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(57) Abstract 

The present invention is to haloaliphatic dehalogenase enzymes capable of converting halogenated aliphatic substrate molecules 
to vicinal halohydrins, as well as to DNA sequences encoding the polypeptide of the enzymes, to expression constructs containing this 
DNA, and to methods for producing the enzymes by placing the expression constructs into host cells under conditions sufficient for the 
tTBnsformants to produce the dehalogenase. A process for immobilizing the enzyme on a solid support and use of the immobilized enzyme 
for converting a halogenated aliphatic hydrocarbon to an alcohol is also disclosed. 
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RECOMBINANT HALOALIPHATIC DEHALOGENASES 

Large quantities of short-chain halogenated aliphatic hydrocarbons (HAHs) are 
produced for use as organic solvents , degreasing agents, pesticides, internnediates for the 
5 synthesis of various other organic compounds and as ingredients in the nnanufacture of 

plastics. The extensive use of these halogenated compounds in industrial processes creates 
a substantial opportunity for new technologies capable of upgrading and/or recycling low- 
value co-products. 

Excess HAHs produced as co-products in chemical manufacturing process can be 
!0 burned to produce heat and, in some cases, be recycled to low value starting materials, thus 
yielding some recovery from a waste product or excess co-product. In complex microbial 
environments (nature, water treatment plants, etc.), HAH degradation occurs by microbial 
biodegradation. Biodegradation of HAHs results in the formation of carbon dioxide, water, 
and hydrochloric acid when the halogen is a chloride. 

15 The biodegradation of HAHs to carbon dioxide, water, and hydrochloric acid by select 

microorganisms is disclosed in U.S. Patent Nos. 4,853,334 and 4,877,736. A process for 
the decomposition of chlorinated aliphatic hydrocarbons, without specifying the 
microorganism involved is described in U.S. Patent No. 4,749,491. In addition, the aerobic 
metabolism of trichioroethylene by Acinetobacter spp, has been reported by Nelson ef a/., 

20 AppL Environ. Microbiol., 52:383-384 (1986). An overview of the degradation of halogenated 
aliphatic compounds in the environment is given in Vogel etaL, Environ. Sci. TechnoL, 
21 :722-736 (1987). U. S. Pat. No. 5,372,944 discloses a Rliodoccocus species which 
produces a dehalogenase which converts HAHs to halohydrins. However, these references 
largely rely on cellular systems and do not take advantage of the benefits that may be 

25 obtained from the use of an Immobilized, activity-modified enzyme in a continuous feed 
process. Most relevantly, U.S. Patent No. 5,372,944 relies on Rfiodococcus cuWures 
comprising wild type or mutant cells. However, the mutation techniques taught therein do 
not take advantage of recombinant DNA methods and so fail to capitalize on the benefits 
these methods offer in terms of improvement in activity and expression of the dehalogenase 

30 enzyme. 

Rather than depend on biodegradation of HAHs by cell cultures, it would be 
advantageoL's to have an improved, recombinant enzyme that can be readily adapted to 
continuous-feed methods whereby the HAHs could be efficiently converted to valuable 
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intermediates for use in production of other useful products, such as chemical intermediates 
in the preparation of polyethers to form polyurethanes or in the preparation of glycols and 
polyglycols to form lubricants, surfactants, emulsifiers, etc. 

The present invention is directed to a recoHibinant enzyme, capable of converting 
HAHs to vicinal halohydrins, comprising an amino acid sequence substantially homologous 
with the amino acid sequence of residues 1-292 oi Figure 2. Another object of the invention 
is to provide DNA sequences encoding a polypeptide comprising such an enzyme, more 
specifically to DNA sequences comprising a polynucleotide substantially homologous with 
the nucleotide sequence of bases 37-912 of Figure 2. 

Another object of the invention is to provide a vector containing the DNA sequence(s) 
and a method for producing the polypeptide comprising placing the vector into a host cell 
and growing the host cell under conditions allowing the transformant to produce the 
dehaiogenase. 

Further objects of the present invention are to provide an immobilized form of the 
enzyme on a solid support as well as a process for converting a HAH to an alcohol or 
halohydrin comprising contacting the HAH with the immobilized enzyme. 

Brief Description of Drawinos 

Figure 1 illustrates a plasmid map of the vector pEXPROK. Plasmid pEXPROK is 
derived from the commercially available pPROK-1 plasmid (Clontech, Mountain View, CA) 
containing the Ptac promoter and the 5S, T1T2 terminator sequences. In the figure, the 
T1T2 region is indicator by "Term." This plasmid was generated by replacing the pPROK-1 
multiple cloning site with a pair of oligonucleotides which introduced restriction site Nco I, 
Hind 111, Xho I, Nhe I, and A/of I into the linker. The "ATG" sequence of the Nco I site 
represents a functional in-frame start site. The Nhe I site is followed by the EXFLAG linker 
sequence. The sequence of the EXFLAG linker corresponds to nucleotides 919-975 in 
Figure 2 and encodes amino acids 295-315 in the RDhl protein sequence shown in Figure 2. 

Figure 2 (i.e. Figures 2A and 2B) presents the nucleotide sequence encoding the 
putative Rhodococcus rhodochrous JDJM003 haloalkane dehaiogenase enzyme and the 
amino acid sequence derived from this nucleotide sequence. Amino acid residues 1-292 
correspond to the Rhodococcus dehaiogenase (RDhl) structural gene and are encoded by 
nucleotides 37-912. Amino acid residues -12 through -1 (nucleotides 1-36) represent a 
polyhistidine-containing amino-terminal tail, with residues -12 and -1 1 participating in the 
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formation of both the translational start site and the Nco 1 cloning site. Amino residues 293- 
294 (nucleotides 913-918) are encoded by the Nhe I cloning site and are followed by amino 
acids 295-305, which are referred to herein as the EXFLAG peptide. The EXFLAG linker 
(nucleotides 919-975) encodes the EXFLAG peptide and a dual-translational stop site (each 
5 indicated by an asterisk). 

Figure 3 illustrates a plasmid map of the vector pEXPROK-RDhl. 

Figure 4 (i.e. Figures 4A and 4B) presents an alignment comparison chart of the 
amino acid sequences of the putative Rhodococcus rhodochrous TDTM003 haloalkane 
dehalogenase, the Xanthobacter autotrophicus GJ10 dehalogenase, the Renilla reniformis 
10 luciferin monooxygenase, and the Pseudomonas spp. LinB gene product (a tetrachloro- 
cyclohexadiene hydrolase). 

Figure 5 presents a plasmid map of the vector pRDhl-K02.3-EXPROK comprising 
the putative Rhodococcus rhodochrous TDTM003 haloalkane dehalogenase gene under the 
control of the IPTG-inducibie Pfac transcription promoter. 

]5 Figure 6 illustrates a plasmid map of the high level expression vector pRSET-RDhI 

comprising the putative Rhodococcus rhodochrous JDTM003 haloalkane dehalogenase 
gene under the control of the T7 transcription promoter. 

Figure 7 illustrates a plasmid map of the high level expression vector pTrcHis-RDhI 
comprising the putative Rhodococcus rhodochrous TDTM003 haloalkane dehalogenase 

20 gene under the control of the trc transcription promoter. 

Figure 8 illustrates a plasmid map of the high level expression vector pTrxFus-RDhl 
comprising a modified version of the putative Rhodococcus rhodochrous TDTM003 
haloalkane dehalogenase gene fused to the gene encoding £. co// thioredoxin, the combined 
fusion gene being under the control of the P, transcription promoter. 

25 Figure 9 presents an image of an SDS-PAGE gel of cell lysate samples from cells 

expressing the pEXPROK-RDhI clone, compared to the partially purified rRDhI enzyme. 

Figure 10 presents an image of an anti-FLAG antibody immunoblot of an SDS-PAGE 
gel identical to that of Figure 9. 

Figure 1 1 presents an image of an SDS-PAGE gel of cell-free extracts from cells 
30 expressing pRSET-RDhl. 

Figure 12 presents an image of an anti-FLAG antibody immunoblot of an SDS-PAGE 
gel identical to that of Figure 1 1 . 

3 
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Figure 13 presents an image of an SDS-PAGE gel of cell-free extracts from cells 
expressing pTrcHis-RDhl. 

Figure 14 presents an image of an anti-FLAG antibody immunoblot of an SDS-PAGE 
gel identical to that of Figure 13. 
5 Figure 15 presents an image of an SDS-PAGE gel of cell-free extracts from cells 

expressing pTrxFus-RDhl. 

Figure 16 presents a productivity profile for an immobilized enzyme bioreactor acting 
on the substrate, 1 ,2,3-Trichloropropane. 

Figure 17 presents a bar chart of the activities of EPPCR-mutated Rhodococcus 
10 rhodochrous haloalkane dehalogenases. 

Figure 18 presents a bar chart of the activities of EPPCR-mutated Rhodococcus 
rhodochrous haloalkane dehalogenases. 

Figure 1 9 presents a graph of enzyme activity data for an RDhI enzyme bearing a 
carboxy-terminal S-Tag polypeptide tail and for an RDhI enzyme bearing a carboxy-terminal 
15 EXFLAG polypeptide tail. 

The present invention results from intensive research into obtaining a DNA sequence 
encoding a polypeptide having haloaliphatic dehalogenase activity from a microorganism 
belonging to the genus Rhodococcus, making recombinant DNA sequences by integrating 
the DNA sequence per se - or as modified - into a vector, and transforming a 
20 microorganism with the recombinant vector. Transformants were screened for 

dehalogenase activity levels and from those with heightened activity, the dehalogenase 
enzymes were isolated. Various solid support immobilization systems were then evaluated 
to identify enzyme-support combinations in which the enzyme could effectively convert 
halogenated aliphatic hydrocarbons to alcohols or halohydrins. 
25 Halogenated aliphatic hydrocarbons (HAHs) subject to conversion using the 

immobilized dehalogenase include C^-C,, aliphatic hydrocarbon molecules and groups which 
have two or more halogen atoms attached, wherein at least two of the halogens are on 
adjacent carbon atoms. Preferred HAHs are saturated hydrocarbons in which at least one of 
the halogens occupies a primary position on the molecule or group; more preferred are 
30 those in which no more than 1 halogen occupies the same carbon atom. Especially 
preferred HAHs are saturated hydrocarbons comprising 1 ,2-dihalo groups, examples of 
which are the 1,2-dihaloethane, 1,2-dihalopropane, 1 ,2-dihalobutane, and 1,2,3- 

4 
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trihalopropane molecules and groups. These classes include, for exannple, 1 ,2- 
dichloroethane, 1,2-dichloropropane, 1,2,-dichlorobutane, 1,2,3-trichloropropane, and 1,2- 
dibromo-3-chloropropane molecules and groups. 

As used herein, the term "halogen" means cnlorine, bromine, or iodine. The 
5 preferred halogens are bromine and chlorine. The most preferred halogen is chlorine and 
among the most preferred HAHs are volatile chlorinated aliphatic hydrocarbon (VCAH) 
molecules and groups; especially preferred VCAHs include 1 ,2-dichloropropane and 1 ,2,3- 
trichloropropane molecules and groups. 

As used herein, the term "halohydrin" means a vicinal halohydrin, i.e. any aliphatic 

10 organic compound, other than a carboxylic acid, which contains both a hydroxyl substituent 
and a halogen substituent on adjacent carbon atoms of the molecule. a,[J-halohydrins are 
the most preferred vicinal halohydrins. 

The terms "immunoblot" and "immunoblotting" are used herein to denote the process 
of: 1) transferring protein{s) from an electrophoresis gel, e.g., a polyacrylamide gel for use in 

15 PAGE, to a protein-binding membrane; and then 2) probing that membrane with an antibody 
specific to protein constituents that may be included among those transferred to the 
membrane; and then 3) determining the location of that antibody using any of various 
-chromogenic methods well known in the art, e.g.. developing color in a colorable marker 
which is directly or indirectly linked to the antibody. An example of an immunoblotting 

20 method is the Western blot. 

The terms "permeablize," "permeablizing," and "permeabtization" are used herein to 
denote the process of making something permeable, e.g., to make cell walls permeable. 
The term "sonicate" is used herein to denote the use of sonic waves to rapidly vibrate the 
contents of a test tube or other container, in order to thoroughly mix them. The term 'Vortex" 

25 is used herein to denote the action of mechanically gyrating a test tube along its bottom 
while manually holding the top of the test tube stationary, in order to mix its contents. 

The word "selectable" as used herein, means "able to be selected." For example, the 
phrase "selectable marker" or "dominant selectable marker" indicates a genetic feature, such 
as a gene encoding an antibiotic resistance enzyme, whose presence allows the gene's host 

30 cell to multiply in a corresponding selection medium, e.g., a growth medium containing that 
antibiotic. When such a genetic feature is incorporated into a plasmid containing a gene 
encoding a RDhI enzyme, and cells are then treated to receive the plasmid, growing the cells 
in a selection medium allows the cells actually receiving the plasmid to grow selectively, in 

5 
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contrast to those cells which did not receive or retain the plasmid. This permits the ready 
identification of cells which contain the RDhI gene. 

As used herein, the phrase "expression construct" denotes a plasmid, virus, virion, 
viroid, transposable element, cos-construct, transfectable carrier-associated DNA strand 
5 (e.g., a DNA-coated "gene-gun" pellet or DNA-coated natural or synthetic histone-like 
particle), or other DNA-to-cell delivery system which is known in the art. 

As used herein, in the context of describing amino acid sequences, the following 
single letter designations apply. 



A, a 


Alanine (Ala) 


M, m 


Methionine (Met) 


C,c 


Cysteine (Cys) 


N, n 


Asparagine (Asn) 


D,d 


Aspartic Acid (Asp) 




Proline (Pro) 


E, e 


Glutamic Acid (Giu) 


Q,q 


Glutamine (Gin) 


F, f 


Phenylalanine (Phe) 


R. r 


Arginine (Arg) 


G.g 


Glycine (Gly) 


S,s 


Serine (Ser) 


H, h 


Histidine (His) 


T,t 


Threonine (Thr) 


U 


Isoleucine (lie) 


V, V 


Valine (Val) 


K, k 


Lysine (Lys) 


W, w 


Tryptophan (Trp) 


L,l 


Leucine (Leu) 


Y,y 


Tyrosine (Tyr) 



20 As used herein, in the context of describing DNA sequences, the following single 

letter designations apply: 

A Adenine G Guanine N A, C, G,orT 

C Cytosine T Thymine R A or G 



Y C orT 



25 



The following abbreviations and definitions are used herein: 

@ At, agr., @37°C is "at ST^'C" and @60min. is "at 60 minutes" 
A Angstroms (one Angstrom is 1x10 '° meters) 
A Absorbance, e.g., A^^^ is "absorbance measured at 280nm" 
30 aa Amino acid 

Amp Ampicillin 
2-AMP 2-Aminopropanol 

AMPSO 3-[(1 ,1 -dimethyl-2-hydroxyethyl)amino]-2-hydroxy-propanesulfonic 
acid 

35 ATCC American Type Culture Collection (Rockville, MD, USA) 

base A nucleotide which is part of a polynucleotide, 
bp Base pairs 

CAPSO 3-(cyclohexytamino)-2-hydroxy-1-propanesulfonic acid 
CD Compact disc 
40 CHES 2-(N-cyclohexylamino)ethanesulfonic acid 

CM Carboxymethyt 
CnBr Cyanogen bromide 

A Change or difference, e.g., AA. is "change in absorbance" 
dATP Deoxyadenosine triphosphate 
45 DCS 1 ,4-Dichlorobutane 



6 



DCH 2,3-Dichloro-1 -propanol 

dCTP Deoxycytidine triphosphate 

DEAE Diethylaminoethyl 

dGTP Deoxyguanosine triphosphate 

dTTP Deoxythymidine triphosphate 

EDTA Ethylenediamine tetraacetic acid or ethylenediamine tetraacetate 

EPPCR Error-prone polymerase chain reaction 

GC Gas chronnatography 

GIA Glutaraldehyde 

gnn Grams 

hr Hours 

Hz Hertz (a measure of frequency in units of cycles per second) 

ID Internal diameter 

Ig Immunoglobulin, e.g., IgG is "immunoglobulin G" 

IPTG Isopropylthiogalactopyranoside 

lUB International Union of Biochemistry 

kbp Kilo-base pairs 

kD Kilo-Daltons (one Dalton weighs Vl 2 of a atom) 

K. Inhibition constant 

LB Luria broth 

|ig Micrograms 

|xL Microliters 

|iM Micromolar 

jimole Micromoles 

M Molar (moles of solute per liter of solution) 

mg Milligrams 

min. Minutes 

mL Milliliters 

mm Millimeters 

mM Millimolar 

MW Molecular weight 

N Normal (moles of chemically active solute groups per liter of solution, 

e.g., H2SO, has two acid hydrogens and so 1M H^SO, is a 2N solution) 

nm Nanometers 

ng Nanograms 

NP-40 Nonoxynol; p-(n-C,H,,)-CgH,-(OCH2CH,)pH; also called 

nonylphenoxypolyethoxyethanol (a non-ionic detergent surfactant) 

OD Optical density, e.g., 00^^^ "optical density measured at 600nm" 

oligo Oligonucleotide 

p_ Piasmid, e.g., pRSET, pTrcHis, pTrxFus, or pUC 

PAGE Polyacrylamide gel electrophoresis 

PGR Polymerase chain reaction 

PEI Polyethyleneimine 

pfu Plaque forming units 

phage Bacteriophage 

CAE Quaternized ethyl ammonium (an anion exchange group) 
RDhI Rhodococcus haloalkane dehalogenase enzyme 
residue An amino acid which is part of a polypeptide 
rpm Rotations per minute 

rRDh! Recombinant Rhodococcus haloalkane dehalogenase enzyme 
SDS Sodium dodecyl sulfate 
spp. Species 

7 
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TCP 1 ,2,3-Trichloropropane 
TM Trademark 

Tris Tris(hyclroxymethyl)aminomethane 
tRNA Transfer RNA 
U Units 

V^^^ Maximum enzymatic velocity 

% w/v Percent by weight per volume, Le. number of grams of solute 

per 

100 mL of solution, also written as "% (w/v)" 
% w/w Percent by weight per weight, i.e. number of grams of a substance per 
100 grams of a mixture containing that substance; also written as 
"% (w/w)" 
Approximately 

The following steps were carried out in the hope of obtaining an enzyme, and an 
immobilized enzyme, meeting the objectives of the present Invention. These steps were 
performed using techniques known to those skilled in the art: 

(1) isolation and partial determination of the amino acid sequence of a dehalogenase 
enzyme; 

(2) construction of oligonucleotide probes based on the partial sequence determination; 

(3) isolation of a dehalogenase-encoding DNA fragment by use of the oligonucleotide 
probes, followed by amplification the DNA; 

(4) ligation of the fragment into a cloning vector having a suitable origin of replication and a 
gene encoding a dominant selectable marker; 

(5) transformation and selection of a microorganism containing the recombinant plasmid; 

(6) transference of the DNA sequence to a suitable expression vector and using this 
recombinant vector to transform a host cell; 

(7) production of the recombinant dehalogenase by the transformant; and 

(8) purification of the dehalogenase; followed by 

(9) immobilization of the dehalogenase onto a variety of solid supports; 

(10) use of the immobilized dehalogenase in a process for conversion of HAHs to alcohols or 
halohydrins; and 

(11) selection of effective dehalogenase support systems. 

SiTprisingly, in the process of performing the above-outlined studies, novel 
recombinant dehalogenase enzymes were obtained that have performance characteristics 

8 
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superior to those of the wild-type enzyme from which the recombinant enzymes were 
derived. In addition, effective immobilized dehalogenase support systems were identified. 

The dehalogenase for use in the present invention is preferably derived from 
Rhodococcus species ATCC 55388 and is capable of converting a HAH to a halohydrin or 
5 alcohol, preferably a halohydrin. The preferred recombinant enzyme comprises an 

enzymatically active polypeptide comprising the minimal functional portion of the wild type 
dehalogenase enzyme, i.e. the smallest possible segment thereof which, after proper 
folding, retains haloaikane dehalogenase activity. Preferably, this polypeptide is 
substantially homologous with the amino acid sequence of residues 1-292 of Figure 2. More 

10 preferably, this polypeptide is at least about 90% homologous, even more preferably at least 
about 95% homologous, and yet more preferably at least about 99% homologous therewith. 
Especially preferred are enzymatically active polypeptides having the amino acid sequence 
of residues 1-292 or of residues of Figure 2. 

The preferred recombinant enzyme may also comprise one or more other units such 

15 as labels, tags, tails, linkers, solid supports, chelants, other enzymes, and so forth - 

regardless of their size - which may either be produced with or linked to the enzymatically 
active polypeptide of the enzyme after it is formed. Such units may be excised from the 
enzyme after it has been properly folded and/or immobilized upon a solid support. In a 
preferred embodiment, the enzyme is produced with or linked to a substantially hydrophilic 

20 tail. This tail may be a hydrophilic oligopeptide expressed as part of the enzyme or may be, 
e.g., an oligosaccharide moiety attached by the host cell to the core enzyme after expression 
thereof. The tail must be of sufficient length and hydrophilicity as to allow the core enzyme 
to remain in suspension in an aqueous medium. A preferred tail is a substantially hydrophilic 
oligopeptide expressed as part of the enzyme. More preferably, the enzyme is expressed 

25 with a highly hydrophilic oligopeptide tail. Most preferably, the oligopeptide tail is expressed 
at the carboxy terminus of the enzyme. A most preferred oligopeptide tail is a hydrophilic, 
carboxy-terminal tail which is rich in histidine and/or aspartic acid residues, especially one 
which is from about 5 to about 25 amino acids in length and contains at least about 25% 
histidine or aspartic acid residues, more preferably at least about 50% of such residues. 

30 The recombinant enzyme is preferably produced by a host ceil containing at least a section 
of a polynucleotide having the nucleotide sequence of bases 37-912 of Figure 2. 

The present invention is also directed to recombinant DNA sequences capable of 
expressing the enzymes of the present invention. These DNA sequences include those able 
to express the novel haloaikane dehalogenase(s) by means of translation systems not 
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following, or not fully following, the standard DNA code's codon-to-amino acid 
correspondence pattern. Such systems include those in which certain codons are 
"suppressed" relative to the standard DNA code. In one type of a "suppressed" expression 
system, at least one of the 20 or so amino-acid-specific classes of aminoacyl-tRNA ("aa- 
tRNA") molecules contains at least one tRNA molecule - having an anticodon belongir i to 
that class - which is linked to the '^A/rong" amino acid, so as to predispose the translation 
system to produce a "violation" of the standard DNA code {i.e. by causing the insertion, in at 
least one position in the growing polypeptide chain, of an amino acid not normally found in 
correspondence with the mRNA codon governing that position). In another variation on such 
a system, the pool of amino-acyl-tRNA molecules contains an aa-tRNA whose anticodon is 
complementary to an mRNA codon normally signaling initiation or termination of translation, 
thus suppressing the signal. These systems may exist, e.g., as a result of mutations in one 
or more tRNA molecules or aa-tRNA synthetases, a result of mistakes by non-mutated aa- 
tRNA synthetase(s), or a result of human intervention in forcing the non-standard linkage of 
an amino acid to a tRNA. 

In such a translation system, a DNA sequence of the present invention will still 
produce the novel haioalkane dehalogenase(s) either because the insertion(s) of the "wrong" 
amino acid do not cause the enzyme to lack activity or because the DNA sequence contains 
- at the position(s) where an "incorrect" amino acid would otherwise be inserted - a codon 
that "anticipates" the change in the translation system so as to allow either the insertion of 
the "correct" amino acid or the "correcf signaling of the mRNA codon therein. A preferred 
DNA sequence comprises a polynucleotide substantially homologous with the nucleotide 
sequence of bases 37-912 of Figure 2. More preferably, this polynucleotide is at least about 
90% homologous, even more preferably at least about 95% homologous, and yet more 
preferably at least about 99% homologous therewith. Especially preferred are 
polynucleotides having the amino acid sequence of bases 37-912 of Figure 2. 

As used herein, the phrase "substantially homologous" expresses the degree of 
similarity of a subject sequence - Le. a subject nucleotide sequence (of an oligo- or poly- 
nucleotide or DNA strand) or a subject amino acid sequence (of an oligo- or poly-peptide or 
protein) - to a related, reference nucleotide or amino acid sequence. This phrase is defined 
as at least about 75% "correspondence" {i.e. the state of identical elements - nucleotides or 
amino acids - being situated in parallel) between the subject and reference sequences when 
those sequences are in "alignment." In this context, "alignment" is said to exist when a 
minimal number of "null" elements have been inserted in the subject and/or reference 
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sequences so as to maximize the number of existing elements in correspondence between 
the sequences. "Null" elements are not part of the subject and reference sequences; also, 
the minimal number of "null" elements inserted in the subject sequence may differ from the 
minimal number inserted in the reference sequence. Increased degrees of homology of a 
given sequence, which may be expressed as, e.g., "90% homologous," are likewise defined 
with reference to their degree of sequence identicality to a reference sequence. 

In this definition, a reference sequence is considered "related" to a subject sequence 
where either: 1) both nucleotide sequences encode proteins or portions of proteins which 
may be identified to the same lUB subclass or 2) both amino acid sequences make up 
proteins or portions of proteins which may be identified to the same lUB subclass, regardless 
of whether such identification is based on functional properties, sequence homology, or 
parental origin. "Parental origin" refers to the fact that a given enzyme may initially be 
grouped within an lUB subclass because of its recognized major or minor function(s), but 
after the DNA sequence encoding that enzyme accumulates one or more mutation(s). the 
encoded enzyme may exhibit functional capacities of a different lUB subclass - whether or 
not the enzyme also retains its original functionality; the "different lUB subclass" may fall 
within the same or a different lUB main class. The reference to "portions of proteins" 
signifies that bi- and multi-functional enzymes - including fusion proteins - are also 
contemplated as falling within a given lUB subclass based on the identification to that 
subclass of one of their functional domains. 

In a preferred embodiment of the present invention, the haloalkane dehalogenase at 
least parentally belongs to lUB sub-subclass 3.8.1 . The enzymes of the present invention 
have been found to possess unexpectedly superior properties to those of the wild-type 
haloalkane dehalogenase enzyme found in Rhodococcus as, e.g., was utilized in U.S. Patent 
No. 5,372,944. Generally, aside from its stability under reaction conditions, two 
characteristics of a given enzyme will determine its usefulness in commercial processes: its 
affinity for product, as well as its affinity for substrate. Where an enzyme's affinity for 
product molecules is relatively high, it will be extremely sensitive to feedback inhibition by the 
product. Such an enzyme Will be less useful in commercial processes in which enzymes are 
often required to operate in the presence of significant product concentrations. A convenient 
indicator of an enzyme's relative affinity for product is its inhibition constant measured at 
90% inhibition ("Kj(90)"), i.e. the product concentration at which the enzyme retains only 10% 
of its Vmax, the Vmax being measured when the concentration of product is 0. In regard to 
the present invention, whereas the wild type haloalkane dehalogenase has a measured 
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K,(90) of 20 mM, the recombinant enzyme of the present invention (see Figure 2) has a 
measured K,(90) of 50 mM. In other words, the recombinant enzyme is much less sensitive 
to feedback inhibition by product and can therefore operate in the presence of product 
concentrations that would essentially shut off the wild type enzyme altogether. 

5 The enzyme of the present invention may be expressed alone, or covalently 

attached, along its amino and/or carboxy terminus, to one or more polypeptide tail(s). Such 
tails may be encoded by exons separate from the enzyme-encoding exon or by DNA 
sequences which are part of the enzyme-encoding exon. When the tail-encoding DNA is to 
be part of the enzyme-encoding exon, the tail-encoding DNA may be attached or *1used" to 

10 the 3' and/or 5' end of the enzyme gene, e.g., either: 1) during enzyme gene amplification 
by including the tail-encoding nucleotide sequence in an oligonucleotide primer or 2) during 
plasmid construction by ligating the tail-encoding DNA directly into a plasmid which contains 
the enzyme gene (whether the enzyme gene is inserted into the plasmid before or after 
insertion of the tail-encoding DNA), 

15 Under the influence of the appropriate genetic control elements - i.e. enhancers, 

promoters, transcription and translation start and stop sequences, and so forth - expression 
of such DNA (or mRNA) fusion genes results in production of dehalogenase enzymes with 
polypeptide tails on one or both ends. An example of a preferred tail-free enzyme is that 
having the amino acid sequence of residues 1-292 of Figure 2. Examples of some preferred 

20 polypeptide tails include poly-histidine sequences, polyacid {e.g., poly-aspartic and/or - 
glutamic acid) sequences, cellulose binding domains, and the c-myc, S-Tag, and FLAG 
peptides. Antibodies and affinity columns that bind these exemplary tails are commercially 
available and may be readily used to purify or immobilize the expressed fusion proteins. 
However, many other tails may be used while retaining a functional dehalogenase enzyme. 

25 Whether or not a tail-encoding sequence is included in the expressed gene, the gene must 
include, in a position outside the enzyme gene or the enzyme-tait fusion gene, a translation 
start site, preferably ATG, and will also preferably include an endonuclease restriction site. 

In one preferred embodiment, the open reading frame of a single exon encodes a 
functional dehalogenase enzyme having tails of up to about 30 amino acid residues on the 

30 amino and/or carboxy termini. In this embodiment, when both termini have tails, the tails 
may be of approximately equal length. In another preferred embodiment, the enzyme is 
expressed with both an amino and a carboxy terminal tail, but the carboxy terminal tail is 
significantly longer than the amino terminal one. In this embodiment, preferably the amino- 
terminal tail is up to about 25 amino acids in length and the carboxy-terminal tail is about 2 to 
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about 150 amino acids in length. In any of these embodiments, preferably, the amino- 
and/or carboxy-terminal tail will contain a stretch of at least 5 adjacent histidine residues. In 
an alternate embodiment, the amino terminal tail is about 10-150 amino acids in length and 
preferably contains or is itself a poly-histidine sequence. In this embodiment, the enzyme 

5 may be reversibly immobilized or reversibly inactivated by contact with a surface coated vith 
chelated divalent metal ions, e.g., Mg'' or Nf\ In this embodiment, the poly-histidine- 
containing amino-terminal tail may be so long as to partially or totally block access to the 
enzyme's active site. In an alternate version of this embodiment, the tail may be designed to 
contain one or more amino acid residues which change the configuration of the tail from that 

10 found in a poly-histidine sequence to a bent, recurved, or flexible-joint configuration allowing 
increased access to the active site of the enzyme. 

In a more preferred embodiment, the open reading frame encodes a functional 
dehalogenase enzyme with an amino terminal tail of about 1 to about 25 amino acids and a 
carboxy-terminal extension having a polyhistidine sequence, a FLAG peptide sequence 

15 (available from KODAK Imaging SystemsA/WR, Rochester, NY) and/or an S-Tag peptide 
sequence. In an especially preferred embodiment, the open reading frame encodes a 
functional dehalogenase enzyme having: 1) an amino-terminal tail of up to about 10 amino 
acids and a polyhistidine sequence and 2) a carboxy-terminal tail comprising {i.e. containing) 
the FLAG (see Figure 2) or S-Tag peptide sequence. 

20 The enzymes and/or tails of the above-described dehalogenase enzymes may be 

modified by use of the techniques of directed evolution, in order to improve their productivity, 
stability, and/or inhibition profiles. One directed evolution technique uses the gene shuffling 
method disclosed in U.S. Patent No. 5,605,793 to Stemmer etal., in which a number of 
similar DNA sequences are fragmented and reassembled in a random fashion to generate 

25 highly diverse libraries which can be screened for enzymes with the attributes of interest. 
Another version of this technology involves use of error-prone gene amplification 
technologies. A third version of directed evolution employs a combination of these two 
methodologies. A fourth version of directed evolution Is the so-called "staggered extension" 
process as disclosed in the publication by Zhao etal., in Nature Biotechnology 0998) 

30 (currently in press). In a preferred embodiment, error-prone gene amplification is used to 
introduce semi-random mutations into the dehalogenase gene (e.g.. Figure 2, residues 1- 
292) at a rate of about 1-6 point mutations per gene copy per gene amplification reaction, 
following whi:;h the mutant library is introduced into bacteria, induced to express protein, and 
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screened for activity, preferably in a spatially addressable grid format (such as a 96 well or a 
384 well plate). 

Effective use of directed evolution to improve an enzyme or enzyme family requires 
an optimized mutagenesis strategy as well as an expression system and a screening 
5 strategy and screening conditions which effectively detect the desired performance attributes 
of the enzyme. For (non-random) primer-dependent mutagenesis methods (e.g., error- 
prone gene amplification and defined primer-based recombination), specific protein 
subdomains can be easily targeted for mutagenesis by primer design and positioning. In a 
preferred embodiment, primers are used which allow mutagenesis of the entire transcription 

10 and translation domain as it occurs within the expression construct. Preferably, primers are 
directed exclusively to the protein coding region of the expression construct or target DNA 
(including tails). In a more preferred embodiment, primers are designed in such a way as to 
target mutagenesis to the dehalogenase enzyme gene while preserving the sequence of the 
tails. For example, in relation to Figure 2, the dehalogenase enzyme gene may be the sole 

15 mutagenesis target when an error-prone gene amplification technique employs both a primer 
complementary to nucleotides closely preceding nucleotide 36 and a primer complementary 
to nucleotides closely following nucleotide 912. Likewise, the entire Figure 2 coding region is 
the mutagenesis target when the primers anneal outside of the region of nucleotides 1-951 ; 
the Figure 2 amino tail or carboxy tail, respectively, is targeted when the primers anneal 

20 outside of the region of nucleotides 1 -36 or 91 3-951 . 

The DNA sequence(s) encoding the enzyme or fusion protein of the present invention 
will preferably be inserted into an expression vector, followed by transfection of the vector 
Into a host cell, and growth of the host cell under conditions in which it expresses the 
enzyme. A wide variety of recombinant host-vector expression systems for prokaryotic cells 

25 are known and may be used in the invention. For example, commercially available vectors 
such as pKK233-2, pKK388-1, pSE380, pTrcHis (A, B, and C), pRSET (A. B, and C), 
pProEX-1, and bacteriophages Lambda (gt1 1), T3, and T7 are all capable of directing 
expression of heterologous proteins in Escherichia coli and other gram-negative prokaryotes. 
In these expression formats, a variety of strain-appropriate inducible promoters can also be 

30 used. In addition, other prokaryotes (such as those of the genus Bacillus, Pseudomonas, 
Actinomyces, Bacillus, or Rhodococcus), eukaryotic microorganisms (such as yeast and 
fungi, e.g., those of the genus Pichia, Saccharomyces, or Aspergillus, e.g., Pichia pastoris or 
Saccharorr.yces cerevisiae), other eukaryotic cells and cell lines (such as Sf21 cells infected 
with baculouvirus-derived vectors), and even algal cells are capable of producing, in active 
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form, heterologous proteins of prokaryotic origin; in the event these other cells are utilized in 
the present invention, appropriate expression vectors would be selected for use therewith. 
Whereas numerous prokaryotic expression vectors are available publicly and may be used in 
the present invention, expression of the novel enzymes is exemplified herein with the use of 
5 commercially available vectors from the pTrcHis, pRSET and pTrxFus series (available from 
Invitrogen of San Diego, CA, USA) in conjunct. on with E. co// host cells. 

When a directed evolution technique, such as error-prone gene amplification (e.g., 
error-prone PGR or "EPPCR), is employed, the DNA of the mutant gene pool produced 
thereby is digested with appropriate restriction enzymes {i.e. those endonucleases having 

10 restriction sites located external to the mutagenesis target); next, the mutant genes are 

purified and iigated into prokaryotic expression vectors to form a plasrnid library. Competent 
host cells, e.g., preferably E co// cells, are then transformed with the plasrnid library and 
grown in a suitable medium: in the case of E. coli, the cells are plated on agar containing a 
selective growth medium. The celts may then be diluted to form individual clones, or in the 

15 case of prokaryotes such as E. co//, they may undergo an initial growth phase, after which 
the cell colonies are picked individually and transferred to separate containers, e.g., the wells 
of a 96 well plate, such that each well contains an individual clone of transformed cells. 
From this library of clones, individual clones can be expanded, induced to express the 
protein of interest, and screened for the activity of interest. 

20 Screening for the haloaliphatic dehalogenase activity of the novel enzymes is 

preferably accomplished by detecting the protons or the halide ions released upon hydrolysis 
of a carbon-halogen covalent bond in a substrate molecule, in a preferred embodiment, the 
pH change accompanying the proton release serves as a measure of enzyme activity; this 
pH change is preferably determined using a fluorescent or visible pH indicator which 

25 undergoes measurable color change over the functional pH range of the target enzyme. In 
an alternate method, multiple parallel pH probes may be utilized. 

In the activity screening assay, the assayed mixture will contain: 1) whole cells, 
permeablized cells, cell lysate, or purified enzymes obtained from cells expressing a mutant 
dehalogenase, preferably from bacterial cells; 2) a substrate; and 3) a low concentration of 
30 buffer (typically < 10 mM). When use of permeablized cells is desired, a chemical detergent 
(e.g., sodium deoxycholate) or a physical freeze/thaw process may be used to make 
bacterial cells permeable. The substrate will preferably comprise one or more halogenated 
aliphatic hydrocarbons as discussed above. The buffer may be selected from any known to 
be effective or found to be effective over the pH range in which the enzyme retains activity. 
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In some cases, the cell debris itself will be seen to provide sufficient buffering capacity to 
allow accurate quantitation of activity. Where an added buffer is used, it will preferably have 
a pKa in the range of about 6 to about 10, although other buffers may be used. Examples of 
preferred buffers include glycine, 2-AMP, CAPSO, ethanolamine, CHES, borate, serine, and 
5 AMPSO; especially preferred is CAPSO and even more preferred is a concentration of about 
5mM CAPSO. 

The activity screening assay will also require the use of a detection method. In a 
preferred embodiment, a pH change is detected. Preferably, a pH indicator will be included 
in the assayed mixture. Any pH indicator having a color change in the pH range in which the 

10 enzyme is active may be used. Preferably, the pH indicator will undergo a color change in 
the range of about pH6 to about pH10, more preferably in the range of about pH7 to about 
pH9. Examples of preferred visible pH indicators include m-cresol purple, cresol red, phenol 
red, bromthymol blue, and thymol blue; examples of preferred fluorescent pH indicators 
include a-naphthol sulfonic acid, 1,4-naphthol sulfonic acid, coumaric acid, 3,6-dioxyphthalic 

15 dinitrile, and orcinaurine. In an alternative embodiment, a pH probe may be utilized to detect 
the pH change. Especially preferred is the use of the visible pH indicator, m-cresol purple, 
and even more preferred is a concentration of about 50|iM m-cresol purple. 

In another preferred embodiment, detection is accomplished by measuring the 
release of halide ions from the substrate by: 1) including in the assayed mixture a halide- 
20 sensitive fluorescent dye, such as lucigenin (available from Molecular Probes of Eugene, 
OR, USA) - lucigenin is quenched upon contact with halide ions and so a decrease in 
fluorescence in measured therefrom; or 2) utilizing a halide ion responsive probe device, 
such as a halide-selective electrode. 

In a third preferred embodiment, detection of enzyme activity is accomplished using a 
25 coupled enzyme system. For example, a coupled enzyme system may be used to detect the 
production of product molecules: dehalogenation of haloalkanes results in generation of 
alcohols, and many alcohols are substrates for one or more commercially available alcohol 
dehydrogenase enzymes (whose activity is measured by disappearance of NADH). 
Detection of alcohols via coupling to the NADH requirement of the dehydrogenases is well 
30 known in the art. 

The enzymes of the present invention may be immobilized onto one or more solid 
support{s). Enzyme immobilization technologies are most conveniently classified into 
covalent and non-covalent methods. Covalent methods utilize reactive groups present on 
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certain amino acid side-chains to bond to a polymeric or inorganic support either directly or 
by using a bifunctional cross-linking agent. The primary advantage of this approach is the 
robustness of the linkage. Non-covalent immobilization methods are more numerous and 
range from direct and indirect {e.g., chelate- or chelant-mediated) ionic, adsorptive, or 
bioaffinity support associations (e.g., biotin-avidin) to gel-entrapment or microencapsulation. 

The choice of a particular immobilization technology for a commercial enzyme 
process is based on a combination of factors. Of primary importance are the cost of the 
support matrix and its biocompatible linking or coupling chemistries. Next are the recovery 
of activity upon immobilization and the robustness of the immobilized support under reaction 
conditions. Unfortunately, since each enzyme is unique, approaches to finding the best 
system are empirical. However, in conjunction with the enzymes of the present invention, a 
preferred method of immobilization involves covalently linking the enzyme to the support by 
means of reactive groups such as epoxides, activated nucleophiles, isourea, and so forth. 
These reactive groups may be present on the native surface of the support material or the 
support material may be modified to bear linkers containing such groups. Preferred linkers 
include those comprising at least one of: dialdehyde, diacid, diamine, diisocyanate, cyanate, 
and diimide groups; linkers comprising at least one carbodiimide group may also be used, 
provided that a diamine group is not used in conjunction with a carbodiimide. Among the 
preferred solid supports are alumina-based supports and silica-based supports; more 
preferred are polyethyleneimine-impregnated alumina- or silica-based supports. A preferred 
method of immobilization comprises pre-treating the solid support with glutaraldehyde and 
then contacting the support with the enzyme. 

Once immobilized, the enzyme may be conveniently used to convert its 
substrate/reactant into product. This conversion can be performed in any suitable medium 
which does not substantially affect the activity of the dehalogenase. Preferably the 
enzymatic conversion is done in a aqueous medium containing either a buffering system or 
one or more pH-control devices. 

The halogenated hydrocarbon substrate is generally added to a reaction medium to 
the saturation point of the substrate, though in some cases, supersaturated substrate 
mixtures, substrate emulsions, or pure substrate preparations may also be used. Given the 
saturation point of most halogenated hydrocarbon substrates, the concentration of 
halogenated hydrocarbon used will generally range from about 0.005% to about 0.5% (w/v). 
Preferably, the concentration of the halogenated hydrocarbon is from about 0.005% to about 
0.25%. More preferred is a concentration of halogenated hydrocarbon from about 0.005% to 
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about 0.2% in medium. The substrate may be added to the reaction solution initially, as in a 
batch method, or be added into the liquid stream of a continuous feed process. In such 
continuous feed processes, the liquid stream may initially contain substrate or the substrate 
may be first added thereto as the stream is en route to the reactor. In either case, more 
5 substrate may be added directly to the liquid stream in the reactor in order to ensure that a 
high concentration of substrate is presented to the enzyme throughout the reactor. The 
liquid stream may be re-saturated with substrate at various intervals in the process in order 
to enable accumulation of product at concentrations higher than the solubility limits of the 
substrate. The batch method reaction is usually carried out with shaking or stirring. 
10 Although the reaction time or reactor residence time may vary depending on the reaction 
conditions, such as the substrate concentration or the amount of enzyme, the reaction 
conditions are preferably selected so that the reaction is completed within a maximum of 120 
hours. 

The invention will be further clarified by a consideration of the following examples, 
15 which are intended to be purely exemplary of the present invention. All percents are percent 
by weight unless othen^/ise indicated. 



General Experimental 
Materials and Media: 

20 All oligonucleotides were synthesized and purified by Genosys Biotechnologies Inc. 

(Woodland, TX), Life Technologies, Inc. (Rockville, MD) or Integrated DNA Technologies, 
Inc. (Coralville, lA). Restriction enzymes and DNA modifying enzymes were purchased from 
Gibco-Bethesda Research Laboratories (Gaithersburg, MD), New England Biolab Inc. 
(Beverly, MA), or Stratagene Cloning Systems (La Jolla, CA) and were used according to 

25 manufacturer's protocols. Competent E. co// AG1 cells were purchased from Stratagene 
Cloning Systems, Competent E co// JM109 cells and TOP 10F cells were purchased from 
Invitrogen Corp. (San Diego, CA). Small scale plasmid DNA isolations were done using the 
Rapid Pure Miniprep (RPM^^) system (BIO 101, Inc., La Jolla, CA). DNA ligations were 
performed with pre-tested reagent kits purchased from Stratagene Cloning Systems. 

30 Purification of DNA fragments was with either QlAquick Gel Extraction Kits and QIAquick 
PGR Purification Kits both purchased from Qiagen Inc. (Chatsworth, CA). SDS- 
polyacrylamide gels and associated buffers and stains, as well as electroblot transfer 
buffers, came from Integrated Separation System (ISS, Natick, MA). Antibodies, anti- 
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FLAG^" monoclonal antibody M2, and goat anti-mouse lgG1 were obtained from 
Intemational Biotechnology Inc. (IBI. New Haven, CT) and Southern Biotechnology 
Associates (Birmingham, AL), . respectively. Bacteria were cultured in Luria-Broth ("LB") 
using premixed reagents purchased from Gibco-Bethesda Research Laboratories (G-BRL; 
5 Gaithersburg, MD). 

Reagents 

1,4-Dichlorobutane, 60% perchloric acid, ferric nitrate, and mercuric thiocyanate were 
fromAldrich. Anhydrous ethanol was from Quantum/USI (Tuscola JL, USA). 1,2,3- 
10 Trichloropropane was a gift from The Dow Chemical Company's Allylics Group (Freeport, 
TX, USA). Monobasic potassium phosphate, dibasic potassium phosphate, imidazole, 
guanidine hydrochloride, disodium EDTA, ammonium sulfate, and Tris free base were Fisher 
Biotech Grade. Sulfuric acid was from Fisher (ACS grade). 

Support Materials 

The Tresyl-Toyopeart chromatography support was from TosoHaas (Lot # 
65TRM72R). Sephadex G-25 prepackaged columns were from Pharmacia. Celite R-648 
was from Manvitle. Polyethyleneimine, 50,000 MW and PEI-silica were from Sigma. 
Glutaraldehyde, Grade 1, as 25% aqueous solution, also from Sigma, was stored at •20°C 
until just prior to use. Other samples used in immobilization include: Davison Low SA 
Alumina, Norton SA 6176 Alumina, Calcicat Type C Alumina, Calcicat s-88-473 Type A 
Silica, Shell 5980-F Silica, Davison 952-08-5X Silica, Borecker subunit Carbon, and 

AmCy 5701 -Sn Carbon. 

25 Methods: 

PGR Reactions 

DNA Amplification was performed using standard polymerase chain reaction buffers 
supplied by Perkin-Elmer-Cetus (Nutley, NJ). Typically, 50 ^L reactions include 1x 
concentration of manufacturer supplied buffer, 1.5 mM MgCI^, 125 ^M dATP, 125 ^M dCTP, 
30 125 ^M dGTP, 125 ^iM dTTP, 0.1-1.0 |iM forward and reverse primers, 5U AmpHTaq DNA 
Polymerase and <1 ng target DNA. Unless otherwise indicated, thermal profile for 
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amplification of DNA is for 35 cycles of a thermal profile of 0.5 min. @94X; 1 min. @55°C; 1 
min. ©72*^0. 

Protein Detection bv Polyacrylamide Gel Electrophoresis 

Soluble protein was mixed 1:1 with solubilization buffer (Tris/SDS/B-mercaptoethanol, 
pH 6.8; ISS) and boiled for five minutes before being loaded on 10-20% gels (Daiichi, Natick. 
MA) and electrophoresed with Tris-glycine buffer (ISS). Gels were stained with Pro-Blue^^ 
(ISS). 

Standard Chloride Detection Assay to Determine Units of Enzyme Activity 

When using 1,4-dichiorobutane (Aldrich) as a substrate, 100 mM NaGlycinate pH 9 
was added to each 9 mL capped viat to a final volume of 6 mL. When using 1,2,3- 
trichloropropane as a substrate, 10 mM TrlsSulfate/1 mM EDTA (pH 7.0) was used. Six pL 
substrate were then added and the contents were vortexed. Vials were incubated at 30^*0 
for 1 hour with stirring. Sampling occurred at 5 time points by removing 1 mL of mixture and 
placing it in an Eppendorf tube containing 100 pL 0.375 M Fe'*(N03)3 in 5.25 M HCIO,. 
Tubes were vortexed. When all samples had been collected, 100 pL mercuric(ll) 
thiocyanate saturated in ethanol was added to each of the tubes. Once again, samples were 
vortexed, then centrifuged for 3 minutes. Optical densities were read at 460nm. Slopes 
representing change in absorbance over time (AA/min) were determined and divided by 1 .52 
(the extinction coefficient at 460nm using NaCl as standard in units of AA/pmole CI") to give 
pmole CIVmin. One unit of enzyme activity is defined as the amount required to 
dehalogenate 1 .0 pmole of substrate/minute under the specified conditions. 

Procedure for Error-Prone PGR Mu taoenesis 

In this directed evolution procedure, an RDhI enzyme gene or RDhI fusion protein 
gene was provided as an EPPCR mutagenesis target, e.g., by using appropriate restriction 
enzymes to digest a plasmid containing the target DNA sequence. In most cases, the target 
DNA was purified by gel electrophoresis, followed by gel extraction of the target DNA. 
EPPCR involved performing a standard PGR gene amplification of the target gene, using 
appropriate oligonucleotide primers, except that the standard PGR buffer was supplemented 
with sufficient magnesium chloride and manganese chloride to bring the reaction mixture to 7 
mM magnesium chloride and 0.15 mM manganese chloride. This procedure may be 
repeated upon one or more of the EPPGR products to introduce further mutations therein. 
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15 



The resulting EPPCR products were ligated into expression vectors (e.g„ pTrcHis. 
pTrxFus) and the vectors were then used to transform appropriate, competent host cells, 
e.g., E. CO// AG1 or JM109 ceils, for enzyme expression and enzyme activity analysis. 
Plasmid-containing clones were identified by selective growth on LB/Amp agar plates. 
5 Individual colonies were transferred by toothpick into the wells of a 96-well plate containing a 
selective growth medium and incubated at 37°C for -8-1 2hr to allow for growth. Following 
the initial growth phase, replica plates were generated, expanded, and individual ::lones 
thereof were assayed for dehalogenase activity as described in the following section. 

PrnrpHurf^ for Measuring RDhI Enzvme Ac tivitv bv Detection of pH Change 

jQ RDhI enzyme activity was measured by detecting the pH change resulting from 

action of the enzyme in dehalogenating substrate. Prokaryotic host cells expressing the 
enzyme were grown in broth, quantitated, and permeablized prior to addition of a pH 
indicator, buffer, and substrate. 

Each well of a 96-wel! microplate received 200|iL of an SOB broth (obtained from 
Difco, Detroit, Ml, USA) which had been supplemented with about 50-100|ig/mL of ampicillin 
("SOB/Amp"). Cells from a single colony of enzyme-producing E. coli clone were inoculated 
into one well of the plate. When testing a library of rRDhI enzymes or rRDHL fusion 
proteins, each well was inoculated with cells from a different E. co// clone. Six wells received 
no cells, in order to serve as a negative control, and six additional wells were inoculated with 
an E. CO// clone producing the wild-type RDhI enzyme, as a positive control. The inoculates 
were incubated overnight in a Psycrotherm oven at 37X while being shaken at 250 rpm. 

After incubation, the cultures were induced by addition of IPTG to a final 
concentration of 1mM, followed by another 5 hours of incubation at 37°C in a Psycrotherm 
oven with 150 rpm shaking. After the 5 hour incubation, the cell density of each culture was 
25 determined by use of a 1 .573 Vmax/Kinetic Microplate Reader (Molecular Devices, 

Sunnyvale, CA, USA). 20^L aliquots of each of the induced cultures were then transferred 
the wells of a fresh 96-well plate and 2.2^L of pH8.0, 10x permeablization buffer (10 mM 
sodium deoxycholate, 1% NP-40, 50 mM Tris, and 50 mM EDTA) was added to each 
aliquot, followed by shaking for 3-5 min. at moderate shaking speed. Each of the cell culture 
30 aliquots then received 200^1 of a DCB-saturated buffer (>1 ^L DCB/mL buffer system), at pH 
9.2-9.5, which contained 5mM CAPSO and lOO^M cresol purple. The developing color 
change of the indicator was measured by use of a SpectraMaxPlus microplate reader 
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(Molecular Devices, Sunnyvale, CA, USA) and the slope of the color change was plotted to 
extrapolate the initial enzyme activity. 

Example 1 

Isolation of dehalooenase enzyme from Rhodococcus . 

5 Rhodococcus species ATCC 55388 was cultured as described in U.S. Patent No. 

5,372,944. An enzyme extract was prepared from this culture as generally described in U.S. 
Patent No. 5,373,944 by taking a 25-75% ammonium sulfate cut, two ion exchange 
chromatography steps (1. DEAE-Sephadex; 2. DEAE-Sephacryl) in which the salt 
concentration was varied over the range of 0-400 millimolar sodium sulfate in the form of a 
10 gradient, gel filtration chromatography using Sephadex G-75, and then concentration by 

ultrafiltration to obtain an enzyme preparation containing greater than 65% dehalogenase by 
SDS-polyacrylamide analysis. 

A portion (-25 mg) of the purified enzyme was subjected to cyanogen bromide 
digestion. Peptide fragments were isolated using an RP-8 Macrosphere (Altech) mixed mode 
15 cation column with a 0-80% acetonitrile/water gradient containing 0,1% trifiuoracetic acid. 

Three purified protein and purified cyanogen bromide (CnBr) fragments were 
subjected to sequencing by automated Edman degradation. The sequences of the N- 
terminus and three CnBr fragments were determined. One of the CnBr fragments was 
identical to the N-terminus in sequence. The other two corresponded to unique internal 
20 dehalogenase sequences. Sequences of all the peptides are shown in Table 1. 

Table 1: Sequences of N-terminal and Proteolvtic F ragments Derived from Purified 

Rhndnnnccus Dehalogenase. 
N Terminal Sequence: 
SEIGT GFPFD PHYVE VLGER 
Cyanogen Bromide Fragment Sequences: 

1. HYVDVGPRDG 

2. DHYRE PFLKP VDRE 



DMA Primer Design 

Primers RDhI 5.4 and RDhI 3.12 were designed to allow amplification and cloning of 
the open reading frame encoding the Rhodococcus dehalogenase (RDhI) gene in expression 
system pEXPROK. The sequence of RDhI 5.4 was derived from the N-terminal sequence of 
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the protein whereas RDhI 3.12 was designed based on the actual DNA sequence. Primers 
RDhI 5.7 and RDhI 3.13 were designed to generate an RDhI gene in expression system 
pRSET and pTrcHis. Primers Trx2++ and Trx- were designed to generate an RDhI gene in 
expression system pTrxFus. 

The sequences of these oligonucleotide primers are as follows: 
Table 2: Sequences and Orientation of Olioonucleotide P rimers Used in Cloning or the 
Rhodococcus Dehaloaenase 



Oligo Name 


Orieniation 


Design 
based on 


Sequence* 


RDhI 5-4 


Fonvard 


N- 

terminal/homology 


5 ' GGTTCCATGGGNTT { CT ) CCNTT { CT ) GA ( CT) CCNCA ( TC ) TA 


RDhI 3.12 


Reverse 


.^'-Sequence Data 


5 • B i Q - CAGAGCTAGC GAGTCCGGGGAGCCAGCG 


RDhI 5.7 


Forward 




5 ' CGTA£AlAISG£mi:SfiGG GGT TCT CAT CAT CAT 
Nde 1 Nco 1 G G S H H H 
CAT CAT CAT GGT ATG TCT GAA ATA GGT ACC 
H H H 

GGT TTT CCC TTC GAC CCT CAT TA-3 ' 


RDhI 3.13 


Reverse 




5 '-GAT GAC AAA TAA TGA GCG GCC gCA hQQ TTG TAC-3' 
Not 1 Hind III 


Trx2++ 






^-.rr r,r,G hat CCC ATG GCT TCT GAA ATA GGT ACC GGT 
BamH I Nco I 

TTT CCC TTC GAC CCT CAT TA-3' 


Trx- 






^'-TCn ACT GCA GGC GGC CGC TCA TTA TTT GTC ATC-3' 
Pst I Not I 



*Bio=Biotin; N=A, C, G, or T; , . . ^ j ^ 

position Underlined sequences correspond to 5' sequences intended to 
10 introduce, into the amplified DNA products, restriction sites compatible 
with the intended cloning vector (pEXPROK) . 

ninnin q of the Partial Rhodoco ccus Dehaloaenase Genes 

Cloning of the Rhodococcus dehalogenase gene was accomplished by amplification 

15 from a genomic DNA library as follows. Genomic DNA was isolated from the Rhodococcus 
ATCC strain 5538 using the methods of P.J. Asturias and K. Timmis (J. Bacteriology 
175:4631-4640 (1993)). Purified genomic DNA (100 \xg) was sheared mechanically to an 
average size of <10 kbp. Fragments were ligated to BamH i linkers, followed by BamH I 
digestion and ligation Into a BamH I digested preparation of bacteriophage Lambda-ZAP 

20 Express^^ DNA (obtained from Stratagene, Inc. of LaJolla, CA, USA). A library containing 
the genomic Rhodococcus DNA fragments was prepared commercially (Stratagene, Inc., 
LaJolla. CA) and supplied at a titer of 1x10' pfu/^L (plaque forming units per microliter). A 
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redundant DNA primer (RDhI 5.4) corresponding to the codons for amino acids 6-13 of the 
N-terminal sequence was synthesized using solid phase phosphoramidite chemistry and 
purified by HPLC (Table 2). 

The RDhI 5.4 primer was used in combination with a commercially available primer 
5 which recognizes the T3 bacteriophage promoter sequence (and is contained within the 
Lambda ZAP Express"^"^ vector) to amplify dehalogenase sequences from the singly- 
expanded genomic DNA bacteriophage library. Amplification was accomplished using the 
polymerase chain reaction (50 |iL) containing 1 juM of RDhI 5.4 primer, lOOnM biotinylated 
T3 Pro primer (New England Biolabs), lOx Amplitaq reaction buffer (Perkin-Elmer-Cetus), 

10 1.5 mM MgClj, 5U of rAmpliTaq DNA polymerase (Perkin-Elmer-Cetus), and 4 jiL of the 
phage library (whole phage). Amplification was for 35 cycles of the following thermal profile: 
1 min. @94°C; 2 min, @ 55°C; 2 min. @72°C. PGR products were separated by 
electrophoresis through 1.0% agarose and a discrete band of 1.3 kbp was identified, excised 
from the gel, and isolated using a QiaQuick gel purification kit (Qiagen, Inc.). After 

15 confirming that this DNA was also capable of being amplified by other Rhodococcus 

dehalogenase-specific primers, the fragment was digested with restriction enzymes Nco I 
and Pst\ and ligated into Nco \/Pst\ digested pGEM52f(+) (ProMega, Madison. Wl). 
Sequencing of the 3'-untranslated region of the cloned segment allowed identification of a 
putative stop codon and subsequent amplification of the coding region with primers RDhI 5.4 

20 and RDhI 3.12. 

Sequence and Restriction Enzvme Analyses . Double stranded sequencing of the 
dehalogenase gene proceeded via successive rounds of the dideoxy method with the 
biotinylated primers (Table 3) designed for each successive round, based on the sequencing 
results in preceding rounds. Bands were separated on 5.5-6.0% polyacrylamide urea 
25 sequencing gels, the DNA transferred to nitrocellulose filters by capillary transfer and 

visualized using the well-known streptavidin-alkaline phosphatase development protocols in 
combination with chemiluminescent substrates. 
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Table 3: Sequences and Orientation of OliaonucleGtidePiic nn-ers'lJsBdiin Sequencing the 
Rhodococcus Dehaloaenase Gene 



Oligo Name 


Orientation 


Specific for bp 


Sequarace* 


Dhl Seq 7 


Forward 


697-714 


5'Bio-CCTGTCCCSAAGTTGTTG 


Dhl Seq 8 


Reverse 


807-791 


5'Bio-CGGQCCGATGTCCACTG 


Dhl Seq 1 1 


Fo-ward 


186-202 


5'Bio-TGCTCCAGACCTGATCG 


Dhl Seq 12 


Reverse 


496-480 


5'Bio-TCTGATCGATGATCAAC 


Dhl Seq 13 


Forward 


404-422 


5'Bio-TCCCGACGTGGACGAATG 


Dhl Seq 14 


Reverse 


663-646 


5'Bio-GAGCGCGACGATGTTCGC 


Dhl Seq 15 


FonA^ard 


725-742 


5'Bio-CACCCGGCGTACTGATCC 


Dhl Seq 18 


Reverse 


951-934 


5'Bio-GAGACCGGTCAGCATTCC 


PR0K-SEQ1 


Forward 


PROMOTER 


5'Bio-GAGCGGATAACAATTTGA 


PR0K-SEQ2 


Reverse 


TERMINATOR 


5'Bio-TCTCATCCGCCAAAACAG 



biotinylated primers already described in Table 2 which also were used to determine the sequence of 
the gene. Commercially available (New England Biolabs) biotinylated primers specific for the T3. 17, 
and SP6 promoters were also used but are not listed here. 

The vector pEXPROK (Figure 1) is a derivative of the commercially available 
pPROK-1 vector (Clontech, Inc.. Mountain View, CA). Whereas the pEXPROK retains the 
functional elements of the pPROK vector (including the ampicillin resistance marker, the 
Pfac transcriptional promoter, and paired transcription termination signals following the 
polylinker), pEXPROK vector replaces the EcoR l-to- Hind 111 polylinker of pPROK-1 with an 
extended synthetic polylinker referred to as EXFLAG. The EXFLAG linker is designed to 
allow insertion of an open reading frame between an Nco I site and an Nhe \ site. In-frame 
with the six-nucleotlde Nhe I site is an 11 amino acid peptide, the final octapeptide of which 
corresponds to the well-known FLAG peptide (Kodak Imaging Systems, Rochester, NY) to 
which antibodies and affinity reagents are commercially available. Sequence and features of 
the EXFLAG linker are as follows: 
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EcoRI Nco l Hind III Xba I Xho I Nhe I I EXFLAG- 

nAATrCAG CCATGGCA TAAGCTT TCTAGA CTCGAG GGA GCTAGC GGC CTA GGT 

Gly Leu Giy 



peptide -— Not I 

GAG TAG A AG GAG GAT GAT GAG AAA TAA TG A GCGGCCGC TAGCTT 
AspTyr Lys Asp Asp Asp Asp Lys *** *** 

PGR amplification of the RDhI 5.4/T3Pro gene from the pGEM5 construct with 
primers RDhI 5.4 and RDhI 3.12, followed by digestion with Nco I and Nhe I, allows ligation 
of the Rhodococcus dehalogenase gene into the appropriately digested expression vector. 
This procedure was used to insert the RDhI gene into pEXPROK. The plasmid maps of 
pEXPROK and pEXPROK-RDhI are shown in Figures 1 and 3, respectively. The DNA 
sequence of the pEXPROK-RDhI construct was later confirmed by automated DNA 
sequencing. 

Sequence Analysis . The complete DNA and derived protein sequences for the 
dehalogenase gene are shown in Figure 2. DNA Sequence data reveals an open reading 
frame of 876 bp, giving a deduced protein sequence of 292 amino acids and a predicted 
molecular weight of 33kD. This is similar to the molecular weight reported for a number of 
other hydrolytic dehalogenases. 

To determine whether the isolated gene is likely to encode a dehalogenating enzyme, 
a MacVector v.4.5.2 (Kodak, Inc.) sequence analysis package was used to compare the 
derived protein sequence with those of all other known proteins contained in the Entrez 
Sequence Database (the Entrez Database is maintained by the National Center for 
Biotechnology Information). The RDhI polypeptide displays the greatest similarity to 
members of the so-called a/p hydrolase family of enzymes including several haloalkane and 
haloacid dehalogenases, epoxide hydroalses, and enzymes with a number of diverse 
catalytic functions. Alignment of the Dow Rhodococcus dehalogenase with two other 
dehalogenases and a non-dehalogenase (luciferin monooxygenase) enzyme is shown in 
Figure 4. The enzymes included in the figure and their publication references are as follows: 
Xanthobacter autotrophicus haloalkane dehalogenase - D.B. Janssen, ef a/., J. Bacteriology 
171:6791-6799 (1989); tetrachlorocyclonexadiene hydrolase (TCCH or LinB) - Y. Nagata, et 
ai, J. Sacfer/o/ogy 175(20):6403-6410 (1993); Renilla reniformis luciferin monooxygenase - 
W.W. Lorenz, etaL Proc. National Acad, of Sciences, U.SA, 88(10):4438-4442 (1992). 
More recent releases of the Entrez database also reveal significant similarity between the 
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Dow RDhI protein and two hypothetical mycobacterium tuberculosis proteins of unknown 
function (Entrez Database Accession numbers 1449324 and 1478233, submitted 7-22-96 
and 7-23-96, respectively, by K. Badcock and CM. Churcher, ef a/.), as well as with the 
haloalkane dehalogenase isolated from Rhodococcus rhodochrous (Entrez Database 
5 Accession number 1196824, submitted 2-15-96 by A.N. Kulakova, etaL). 

Of me sequences aligned in Figure 4, only the Xanthobacter 6eha\ogenase has been 
well characterized at a structural and mechanistic level. Notably, two of the three residues 
known to be involved directly in the Xanfhodac/er catalytic cycle (the two most important 
residues, Asp-124 and His-289) are conserved in the Rhodococcus sequence. These 
10 similarities and those indicated in the Figure suggest a high degree of structural and 
mechanistic conserv'ation among members of this family of proteins. 

Pf ^h^'QQenase Protein Exoression . 

To confirm the identity of the above, cloned enzyme as a dehalogenase, we sought 
to express the full-length protein in E, coli. To accomplish this, a 1300 bp Nco MSpe I 

15 restriction fragment, containing the RDhI gene was excised from the pRDhlK02.1-pGEM5 
construct and ligated with the Nco MNhe l-digested pEXPROK vector. Because Spe I and 
Nhe 1 generate ligation-compatibie restriction fragments, this resulted in the generation of an 
expression construct (Figure 5) containing the complete putative RDhI gene under the 
transcriptional control of the IPTG-inducible Pfac promoter and the termination control of the 

20 endogenous RDhI 3' untranslated region. 

Colonies transformed with the resulting piasmid (pRDhlK02.3-EXPROK) were grown 
overnight in 2 mL minicultures, following which 1 mL of each culture was pelleted, washed, 
and sonicated. Extracts were then assayed for their capacity to catalyze release of chloride 
following addition of the RDhI substrate, 1-chlorobutane. Chloride releasing activity was 

25 absent from cultures not containing the cloned gene; cultures with the cloned gene exhibited 
chloride releasing activity which increased when transcriptional activity of the gene was 
increased by the addition of IPTG. Thus, dehalogenase activity could be induced in 
overnight cultures of the recombinant E. co// containing the pRDhlK02.3-pEXPROK 
construct. 

30 

Example 2 

The gene encoding this dehalogenating enzyme has been isolated and cloned into 

the bacterium, E. co\i. DNA sequence analysis revealed that this isolated gene encodes a 
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protein with a high degree of sequence similarity to other known dehalogenating enzymes. 
In an effort to increase levels of biosynthesis to commercially meaningful levels (i.e. 
"expression"), a number of systems reported to enable high level production of heterologous 
proteins in Ecoli^exe examined. 

5 To generate the expression vector pEXPROK-RDhI, plasmid pEXPROK was digested 

with restriction enzymes Nco \INhe I and then purified by a QIAquick Gel Extraction Kit 
(Qiagen, Inc., Chatsworth, CA). The RDhI open reading frame was amplified with primer 
RDHL 5.4 as the fonward primer (containing an Nco I site to direct the start of translation) 
and primer RDHL 3.12 as the reverse primer (and containing an Nhe I site). Following 
10 digestion of the amplified DNA with Nco I and Nhe I, the gene was ligated into the pEXPROK 
vector The new construct was then transformed into E. coliAGI competent cells and 
ampicillin resistant colonies were picked. Plasmids containing the RDhI gene were identified 
by analytical restriction enzyme digestion and referred to as pEXPROK-RDhI construct. The 
pEXPROK-RDhI plasmid map is shown in Figure 3. 

15 Construction of pRSET-RDhI and pTrcHis-RDhl Expression Vectors 

For construction of both pRSET-RDhI and pTrcHis-RDhI expression vectors, the RDhI 
gene was amplified from the pEXPROK-RDhI using oligonucleotide primers RDhI 5.4 and 
RDhI 3.13 using standard PGR conditions. Amplification products were separated on 
agarose gels and purified using standard procedures. 
20 Both pRSET and pTrcHis vectors are IPTG inducible expression vectors, derived 

from the pUC 18 and 19 series of cloning vectors. They both were purchased from 
Invitrogen Corp. (San Diego. CA) and contain the following features: 

(a) Both are designed for high level protein expression and both carry an 
ampicillin resistance gene. 
25 (b) Both contain a sequence that encodes an N-terminal fusion peptide which 

codes for six histidine residues. These residues function as a metal binding domain and 
may allow later purification of recombinant protein by affinity chromatography. 

(c) The vectors encode an enterokinase cleavage recognition sequence (the 
FLAG and/or EXFLAG peptide) downstream of the dehalogenase coding region which allows 
30 detection by and immobilization upon anti-FLAG antibodies. 

The high level expression property of the pRSET vector results from the presence of 
the T7 promoter upstream of the heterologous gene. Since E. co// does not contain the T7 
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polymerase, however, an M1 3. phage containing the T7 RNA polymerase gene is needed for 
protein expression. In practice, bacteria containing a heterologous gene under the control of 
a T7 promoter are induced to produce the heterologous protein by infection of recombinant 
E CO// with T7 phage containing the 17 RNA polymerase. Alternatively, commercially 
5 available E. coli stably expressing the T7 RNA polymerase enzyme (i.e. BL21) can be 
transformed with the pRSET construct. 

The pRSET-RDhI expression vector was generated by digesting plasmid pRSET with 
restriction enzymes Nco MHind III and then incorporating an RDhI gene fragment which 
contains an Nco I site at the 5' end and a Hirvi III site at the 3' end. The new construct was 
10 then transfected into £ co// JM109 competent cells and ampiciliin resistant colonies were 
picked. Plasmids containing the RDhI gene were identified by analytical restriction enzyme 
digestion and referred to as the pRSET-RDhI construct. The pRSET-RDhI expression 
construct is shown in Figure 6. One such clone (Clone 16-4) was used to characterize 
protein expression using the pRSET system, 

15 The pTrcHis vector contains another high level transcriptional promoter - the trc 

promoter, a fusion of the well-characterized trp promoter and the lac promoter. The pTrcHis 
vector also contains a mini-cistron upstream of the heterologous gene which provides highly 
efficient, repeat initiation of translation of the cloned protein in the multiple cloning site. 

Using a similar process, we cloned an RDhI gene fragment into the pTrcHis vector to 
20 generate the pTrcHis-RDhI expression vector. For expression studies, E. co// TOP 10' 
competent cells were transformed with the pTrcHis new construct. Both the pRSET-RDhI 
and pTrcHis-RDhI expression vectors contain an 1 1 amino acid EXFLAG peptide 
downstream of the Nhe I site. 

The EXFLAG peptide sequence is in-frame with the open reading frame of the cloned 
25 protein and is useful for analytical detection and affinity purification. Figure 7 shows a map 
of the completed pTrcHis-RDhI expression construct. One such clone (Clone 18-3) was 
identified as a high expressing clone and used for further characterization of the TrcHis 
expression system. 

Construction of pTrxFus-RDhl Expression Vector 

30 The ThioFusion"'' expression system (Invitrogen Corp., San Diego, CA) provides a 

means of expressing large amounts of heterologous protein by fusing the gene encoding 
such a protein to the gene encoding the E co// proiein, thioredoxin, in the pTrxFus 
expression vector. The thioredoxin moiety can confer both solubility and heat stability to its 
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fusion partner, thereby opening up new options for purification by osmotic shock or heat 
treatment. The expression vector, pTrxFus, allows foreign genes to be inserted into its 
multiple cloning site. It uses the promoter from bacteriophage lambda to drive expression 
and the cl repressor, also from bacteriophage lambda, to control the level of transcription. 
5 Expression of the cl repressor gene is under control of the trp promoter and repressor. 
Expression of a foreign gene is induced by adding tryptophan to the medium which shuts 
down cl repressor synthesis and allows transcription from the promoter. 

Primers Trx2++ and Trx- (see DNA Primer Design) were designed to modify the RDhI 
gene fragment with an enzyme restriction site unique to the TrxFus multiple cloning site. 

10 Plasmid pEXPROK-RDhI was used as a template, and a gene fragment was generated by 
PGR, using primers Trx2++ and Trx-, which added a BamH I site to the 5' end and a Pst I 
site to the 3' end. The fragment was purified using a QIAquick PGR Purification Kit. Both the 
pTrxFus vector and the gene fragment were enzyme-digested, agarose gel purified, and 
ligated. The new construct, pTrxFus-RDhI (Figure 8), was incorporated into GI174 

15 electrocompetent cells (Invitrogen Gorp.) which had been prepared following the 
manufacturer's instructions. 

Expression Analvsis 

Growth and Induction of Cell Cultures-Far expression studies, clones identified as 
containing proper DNA constructs were cultured in 3 mL of Luria Broth (LB) or SOB medium 

20 (Difco, Detroit, Ml, USA) containing 50 |ig/mL ampicillin in 15 mL round-bottom 

polypropylene culture tubes. These culture tubes were incubated overnight at 37°C with 
shaking (200 cycles/minute in a rotary shaker) or grown to an ODg^o of 0.6. Afterward, 2 mL 
of fresh medium with IPTG was added (to a final iPTG concentration of 1mM) and the tubes 
were incubated at 37°G with constant shaking for another 4-5 hours. For recombinant 

25 clones of pRSET, after 1 hour of IPTG induction the cell cultures were infected with 
previously titered M13/T7 phage and the incubation continued as described previously. 

For recombinant clones of pTrxFus, RDhI gene-containing clones were cultured in 1 
mL RM medium with 100 |ig/mL ampicillin and incubated overnight at 30°G with shaking 
(200 cycles/minute in a rotary shaker). The next day, 9 mL fresh Induction Medium were 
30 added and growth continued at 30°G to an OD^^q of 0.5. Then, ceil cultures were induced 
with tryptophan (to a final concentration of 100 ^g/mL) and transferred to a 37°G incubator 
and shaken at 200 rpm for another 2 to 4 hours. 



30 



wo 98/36080 



PCTAJS98/02776 



Cell Free Extract Preparation-For protein analysis, induced, overnight cell cultures 
were pelleted by centrifugation at 4''C (5000 rpm for 10 minutes in a Sorvall SS-34 rotor). 
Cell pellets were washed in cold 10 mM Tris sulfate buffer (pH 7.5) containing 1mM disodium 
EDTA and then centrifuged again. For clones of pEXPROK, pRSET. and pTrcHis, final 
suspensions were sonicated at 14 Hz on ice through 3 repetitions of a 20 seconds "on", 30 
seconds "off" regimen, using a small-tip sonic^^ing probe (Soniprep 150, MSE Ltd., Crawley, 
Sussex^ Insoluble debris was removed by centrifugation at 10,000 rpm for 10 minutes. 
Cell-free supernatants were then transferred to clean polypropylene tubes and appropriate 
assays performed. Final cell suspensions from clones of pTrxFus were sonicated for three 
10-second bursts and then flash-frozen in a dry ice/ethanol bath. Shortly after freezing, the 
cell lysates were quickly thawed at and two more, rapid sonication-freeze-thaw cycles 
were performed. After the last thaw, the procedures described above for removing the 
cellular and insoluble debris were continued. 

Expression and Purification of pEXPROK-RDhI 

Figure 9 shows a Pro-Blue™ stained SDS-PAGE gel of cell lysate samples of the 
pEXPROK-RDhI clone 12-4 on the left side (lanes 2-5) and partially purified rRDhI enzyme 
on the right side (lanes 8-11). Lane 1 contains molecular weight standards and lanes 6 and 
7 contain single, 60 ng and 180 ng bands of the FLAG-peptide protein at a molecular weight 
of 55kD. Lanes 2-5 show all the soluble protein from cell-free extracts. Since rRDhI enzyme 
is not a major protein in the extracts, immunoblotting of an identical gel was done to confirm 
the presence of this recombinant enzyme. Figure 10 shows this recombinant enzyme band 
in each sample lane, as recognized by an anti-FLAG antibody at the predicted molecular 
weight of - 35kD. Lanes 6 and 7 in Figure 10 are 20 ng and 60 ng, respectively, of the 
FLAG-peptide protein. Affinity purified recombinant enzyme was analyzed on both a Pro- 
Blue^-stained SDS-PAGE get and an immunoblotting membrane. Four consecutive 
fractions of affinity-purified rRDhI enzyme were run in lanes 8-1 1 of the Pro-Blue^^-stained 
SDS-PAGE gel shown in Figure 10. In addition to a prominent band at -35kD molecular 
weight, other protein bands are visible on the gel. The immunoblot of the partially purified 
enzymes (Figure 10, lanes 8-11), however, confirms that the recombinant enzyme at -35kD 
is the only protein to stain with anti-FLAG antibodies and thus appears to be the proper 
translated rRDhI protein. This data suggests that rRDhI enzyme is stable both in the E. coli 
intracellular environment and throughout the purification process. 
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Fxpression of pRSET-RDhl and pTrcHis-RDhI 

Cell free extracts obtained from clones of the pRSET recombinant enzyme 
expression system and the pTrcHis recombinant enzyme expression system were analyzed 
for the presence of recombinant RDhI protein. Five clones containing the correct size 
5 Nco UHind III DNA fragment were identified, cultured overnight, lysed, and analyzed for 
rRDhI expression by SDS-PAGE gel (Figure 1 1). Lane 1 shows molecular weight standards 
and lane 7 and 8 contain single 60 ng & 180 ng bands of the FLAG-peptide protein at a 
molecular weight of 55kD. Lanes 2-6 show samples of 1 jiL of cell-free extracts from the 5 
clones and lanes 9-12 show samples of 0.1 |iL of the cell-free extracts. Immunoblots of 

10 these extracts reveal doublet bands (35kD and 38kD) in each sample lane, when the anti- 
FLAG antibody is used to stain the immunoblots (Figure 12). This may suggest that there 
are two start codons in the pRSET-RDhI system. The first start codon was originally 
designed in the pRSET vector system to be about 41 amino acids (123 bp) before the actual 
cloning site, which allows the initiation of translation from that Met ATG codon followed by 6 

15 histidines. The second start codon may occur at the Nco I cloning site itself, which was 
designed into the original 5' end primer of the RDhi gene fragment. However, the presence 
of an anti-FLAG antibody-reactive band confirms the presence of rRDhI enzyme. 

Figure 13 shows the Pro-Blue^^-stained SDS-PAGE gel with cell-free extracts from 
the pTrcHis system and Figure 14 shows the immunoblot of an identical SDS-PAGE gel. All 

20 clones of the pTrcHis system show an overloaded, anti-FLAG-reacted band at a molecular 
weight of -35 kD, which confirmed the presence of rRDhI enzyme in the extracts. Since the 
volumes of the initial culture and the cell-free extract preparations are the same in all three 
systems, these overloaded bands are an indication of higher enzyme production in the 
pTrcHis system. 

25 Expression of pTrxFus-RDhl 

The soluble protein, cell-lree extracts from the pTrxFus system were examined for 
the presence of rRDhI enzyme, using reducing SDS-PAGE. Figure 15 shows a gel stained 
with Pro-Btue^^ Lane 12 shows molecular weight standards and lane 1 1 shows a single 
ISOng band of the FLAG-peptide protein at a molecular weight of 55kD. The thioredoxin 

30 fusion bands are clearly visible as the major protein in lanes 1 to 9 at a molecular weight of 
47 kD. This size corresponds to the 12 kD of the thioredoxin protein and 35 kD of the rRDhI 
enzyme. In contrast, lane 10 has a sample of insoluble matter from cell lysis, which shows 
no presence of the high-level, expressed thioredoxin fusion protein. This demonstrates that 



wo 98/36080 PCT/US98/02776 

all of the fusion protein is in a. soluble state. Data from other experiments indicate this fusion 
protein band can be recognized by anti-FLAG antibody at the same molecular weight in the 
immunoblot membrane (data not shown). 

Analysis of Hvdrolvtic Dehaiooenatio n Activity 

To quantify the hydrolytic dechlorination activity of the recombinant enzyme, a 
colorimetric chloride-release assay at 460 nm was used. 

Recombinant protein activity was measured by adding an appropriate amount of cell- 
free extract (prepared as described above) to 6.0 mL of reaction buffer in a glass vial. 100 
mM sodium glycinate buffer (pH 9.0) was used for measuring activity toward 1,4- 
dichlorobutane (DCB) (Aldrich Chemical Co.). and 100 mM Tris-SO, buffer (pH 7.0) was 
used for measuring activity toward 1 ,2,3-trichloropropane (TCP). The halogenated substrate 
(6|iL) and a micro stir bar were added and the vial was capped. Capped vials were incubated 
in a aO'^C water bath with stirring. 

Periodically, 1 .0 mL samples were removed and assayed for free chloride. Reagent 
1, 0.375M Ferric Nitrate in 5.25 N Perchloric acid (10% v/v), was added to stop the hydrolytic 
reaction and reagent 2, saturated Mercuric Thiocyanate in ethanol (10% v/v), was added to 
develop color. Final samples were read in a Perkin-Elmer 552A UVA/IS Spectrophotometer 
at 460 nm. Rates were determined after correcting for non-enzymatic hydrolysis against a 
blank. 

DehalooenatinQ Activity of Recombinant Rho dococcus dehalooenase 

While the preceding data suggest that the recombinant dehalogenase can be 
synthesized at much higher levels in E. co// than in wild type Rhodococcus, they do not 
address the activity of the expressed protein. Indeed, enhancing production of a 
dechlorinating enzyme is the key objective of this work. For this reason, we examined the 
relative levels of dehalogenase activity in representative clones from each of the above 
constructs. Activity was determined by the free chloride release assay and compared with 
protein expression as documented in Figures 13-15. Protein expression was quantified on 
SDS-PAGE gels by high resolution scanning densitometry and the measured amount of 
rRDhI was stated in terms of % of total soluble protein. The following table shows the 
relationship between dehalogenating activity and the percent of rRDhI enzyme in the total 
soluble protein among all four expression systems. 
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Expression 
oysicrii 


% OT 1 otal 

Qnli ihlp Prntoin 


DOd Activity per 

ml of 0\ ilti irp 

MIL- \Jt WUIIUIC 


Clone Name 


pEXPROK 


~3 


0.3 X 10' 


EXPROK-RDhI 


pRSET 


-10 


0.8 X 10 ' 


RSET RDhI Clone 16-4 


pTrcHis 


-15 


2.4x10' 


TrcHis RDhI Clone 18-3 


pTrxFus 


-30 


4.8 X 10 ' 


TrxFus RDhI Clone 4 



* DCB units were measured as the degree of indicator color change (AOD/min.) as the 
enzyme dechlorinated 1 ,4-dlchiorobutane. 



This data reveals a strong correlation between level of rRDhI protein expression and 
5 observed dehalogenase activity. 

In this example, Rhodococcus haloalkane dehalogenase can be expressed at high 
levels in E. Co// in 3 of the 4 systems examined. The recombinant Rhodococcus 
dehalogenase is stably expressed in all four systems and recognized by anti-FLAG 
antibodies at the expected molecular weight. This recombinant enzyme exhibits a 
10 dehalogenase activity at a level similar to that of the wild type and proportional to the level 
of heterologous protein expression. 

Example 3. 

Preparation of Porous Alumina Supports 

15 Protein (385 mg) representing 22 TCP units of activity were immobilized on 2.0 g of 

volatile-free alumina (lot #1587 of k-4 alumina from UOP of DesPlaines, IL, USA) at 4°C with 
mild agitation over the weekend. The procedure followed UOP's standard practice of GIA- 
activation of the polyethyleneimine coating, water washing, and enzyme addition. The 
bathing solution was decanted and the support was washed five times with 2 mM Tris/1 mM 

20 EDTA, pH 7.5. 

Enzyme Purification and Preparation for Immobilization 

Recombinant /?/7odococcLys dehalogenase was produced in E. co// using the pTrcHls 
expression system. Enzyme preparations used for all immobilization studies were first 
partially purified using ammonium sulfate precipitation, using a cut of 45 to 70% saturation at 
25 4 °C, followed by dialysis and clarification in 10 mM Tris sulfate, 1 mM EDTA, pH 7.5. This 
basic buffer was used throughout all purification steps. These preparations were routinely 
4.5-fold purified from the lysate, as determined by absorbance at 280 nm, and were 
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estimated to be 30-35% pure dehalogenase protein by SDS-PAGE. More highly purified 
enzyme preparations were achieved by an additional DEAE-Sepharose chromatographic 
step of eluting with a 0-400 mM ammonium sulfate gradient. This provided about 10-fold 
purification from lysate, with 85-90% enzyme pLrity. This step was followed by QAE- 
5 Sepharose FF chromatography with a narrower 0-1 20 mM ammonium sulfate gradient, 
achieving about 12-fold purification from lysate. and SDS-PAGE which demonstrated 
enzyme homogeneity. Purified RDhI from the TrcHis RDhI expression system is typically 
referred to herein as "rRDhl." 

Preparation of Supports 

10 All anion exchange supports were thoroughly hydrated according to manufacturer's 

instructions (if necessary), then exhaustively washed to remove ethanol and to exchange 
into the sulfate form by continuous rinsing withIO mM Tris sulfate, 1 mM EDTA. This same 
starting buffer was used throughout to load enzyme preparations. 

Inorganic supports were modified with polyethyleneimine and glutaraldehyde 
15 according to well established protocols (U.S. Patent No. 4,268,410 and Mosbach, 

lmmnbili7Gd Enzymes, in 44 Methods in Enzvmologv . (1976) (Academic Press» NY)). Dry 
samples of supports were weighed out and distributed into 12 mL capped vials. An aqueous 
solution of 2.5 % polyethyleneimine was added to a total of 1 0 mL per gram of support. 
Vials were capped, and then agitated gently on a rocking shaker for 1 hour at room 
20 temperature. Samples were transferred to a small Buchner funnel where liquid was 

removed by gentle vacuum. Supports were transferred to a watch glass and allowed to air 
dry at room temperature overnight (about 18 hr). Samples were transferred to a new vial to 
which was added a freshly thawed solution of 25% aqueous glutaraldehyde at a ratio of 20 
mL per gm of support. The mixture was capped and shaken intermittently for 1 hour in a 
25 hood. The glutaraldehyde was removed by decantation and washed exhaustively with water 
until no aldehyde was detected by a fuchsin test. Prior to enzyme immobilization, supports 
were decanted, but not dried. 

Immobilization of Enzvme 

All enzyme immobilization was performed in a cold room at 4°C using a rocking 
30 shaker to provide gentle agitation. Times used for binding of enzyme preparations to 
supports ranged from 1 hour for the ion-exchange supports to a maximum of 4 days for 
experiments with the PEI-GIA modified inorganic supports. Buffer exchange was used only 
for the Celite R 648 binding capacity studies in which the Tris buffer was exchanged for a 10 
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mM potassium phosphate, 1 mM EDTA, pH 7.0 buffer on a pre-packed Sephadex G-25 
column. 

Enzyme Stripping from lon-exchanoe Supports 

Two sets of 250 |jL aliquots of resin slurry were bound overnight with either 1 1 .1 DCB 
5 U (Toyopearl®) or 6.63 DCB U (PEI Cellulose) of enzyme. The binding supernatants were 
carefully removed and assayed. Resins were spun at 6,000 rpm for 8 minutes. Additional 
supernatant was removed and the resins were washed twice with 100 mM Na Giycinate 
buffer (pH 9). One tube from each set was treated with 0.5 M (NHJ^SO, for 1 hour to strip 
the enzyme from the support. These resins were rinsed again with buffer. Resins and 
10 supernatants were assayed for activity using DCB as a substrate. 

Ion-Exchange Supports 

Anion-exchange chromatography has been used extensively in the purification and 
characterization of both the wild type and the recombinant dehalogenase enzymes. The 
enzyme Is anionic at neutral pH (where the dehalogenation reaction is performed) and anion 

15 exchange supports often function well in immobilizing such proteins. This approach was 
also attractive because it allowed for simultaneous purification and immobilization of the 
enzyme. To confirm this potential utility, we examined binding and elution of the rRDhI 
protein to anion and cation exchange resins over a wide pH range. Figure 2 shows the 
nearly quantitative retention of the dehalogenase on DEAE Sepharose anion exchange resin 

20 over a range of 5 pH units. For contrast, the CM-Sepharose rapidly loses its binding 
capacity above pH 5. 

Immobilizations 

A number of support materials were examined for their efficacy in immobilizing rRDhi. 
In these studies, a 40-70% ammonium sulfate cut of the dehalogenase enzyme was used. 

25 Duplicate sets of enzyme, immobilized on each of thirteen ion-exchange supports were 
prepared. One of each set was assayed immediately for dehalogenase activity using the 
chloride release assay. The second was treated with TCP-saturated 10 mM Tris sulfate (pH 
7.5) at AS^'C for 1 hour. Following this treatment, supernatant was removed and this set was 
also assayed by the standard chloride method. Table 4 summarizes the results of these 

30 assays. 
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Table 4 Screen of TrcHis RDhI on 13 ion-exchange supports following incubation at 

AS'^C for 1 hour in the presence of substrate. 



Support 


Supplier 


Lot/Batch # 


% Activity 

flftor TPP 

Treatment 


Silica Gel PEI-Silica 


Sigma 


24H0810 


0 


DEAE Sephadex A-50 


Sigma 


24H0485 


19 


PEI Cellulose(med. mesh) 


Sigma 


94H7200 


54 


Glass, Aminopropyl 


Sigma 


34H8260 


43 


Toyopearl® Super Q-650M 


TosoHaas 


65QAM02RM 


79 


DEAETrisacryl Plus-M 


Sigma 


92H0861 


21 


Spectra/Gel Ion Exchange 1X8 


Spectrum 


16865 


14 


Dowex® 1X8-200 Ion Exchange Resin 


Aldrich 


12627-85-9 


54* 


DE52 


Whatman 


1152032 


50 


Quaternaryammonium Cellulose 


Whatman 


9852032 


2 


DEAE Sepharose 


Sigma 


53H0177 


30 


AG 3X4 100-200 


Bio-Rad 


52594A 


18 


AG 4X4 100-200 


Bio-Rad 


47426A 


9 



Incubated at 37°C 

These results indicate that there is marked heterogeneity in the efficacy of these 
matrices as supports for the dehalogenase enzymes. Similar heterogeneity will be seen for 
similar dehalogenases catalyzing similar reactions. 

Four of the best candidates were screened for stability over time in the presence of 
TCP. These were: PEI cellulose, Toyopearl® Super Q-650M, Glass Aminopropyl, and 
DEAE Sepharose. Duplicate sets were prepared, one to be used for an initial chloride 
detection assay, the second for assay after exposure to TCP. Table 5 shows that 3 of the 4 
lost significant activity in the first 24 hours but retained stable activity out to at least 7 days 
following the initial loss. Toyopearl® undenwent a similar but delayed loss at 48-120 hours 
and then appeared to stabilize. 
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Table 5: Stability Study of TrcHis RDHL on 4 lon-Exchanoe Supports in 
Presence of TCP at Room Temperature 



Support % Activity Over Time 





24 hr 


48 hr 


120 hr 


192 hr 


PEI Cellulose(med. mesh) Sigma 


83 


79 


78 


78 


Glass, Aminopropyl Sigma 


78 


87 


75 


66 


Toyopeari® Super Q-650M TosoHaas 


100 


100 


78 


82 


DEAE Sepharose Sigma 


76 


79 


73 


62 



5 

All four resins appear to be good candidates for immobilizing the dehalogenase and 
appear to provide a suitable surface for prolonged enzyme activity. 

Covalent Coupling to Tresyl-Activated Polyacrylic Polymer 

In order to determine the feasibility of covalentiy coupling rRDhI through pendant 
10 amino groups to any support, an activated resin, Tresyl-Toyopearl, was evaluated. This 
activated resin provides a stable secondary amino group linkage with the enzyme; 

o 
II 

)-0-R-0-CH2-CH20SOCH2CF3 + H.N-Enzyme )-0-R-0-CHrCH2-NH-En2yme 

15 

The rRDhi preparation used for these studies had been affinity purified from E. coli 
lysate using the anti-FLAG antibody column, and was estimated to be about 20% pure by 
SDS-PAGE. 0.35 units of enzyme (1 .96 mg total protein) were coupled to 40 mg of the 
Tresyl-Toyopearl under conditions described by the manufacturer. After 3 hours, 93% of 

20 protein had been coupled as determined by the decrease observed at A^^^. An additional 10 
mg of resin were added and coupling continued for 1 hour to bind >98% of the protein. Re- 
assay of the washed gel for dehalogenase activity revealed recovery of 0.11 units of activity 
(31%). A second trial using 2.6 units of the same enzyme preparation and 1 .0 gm of 
activated resin demonstrated a recovery of 37% activity. According to manufacturer's notes, 

25 recoveries of activity from enzymes coupled to this support usually lie in the 40-60% range, 
so 31-37% represents reasonable recovery and is sufficient to make commercially practical 
the coupling of the enzyme, via its amino groups, to an immobilization support material for 
use in a bench-size or industrial bioreactor. 
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This covalent attachment to hydrophilic resins is also an effective means of 
immobilizing the dehalogenase enzyme. 

Polvethvleneimine Impregnated Inorganic Supports Cross-Linked with Glutaraldehyde 

Inorganic supports have also found wide utility in the industrial enzyme arena due to 
5 availability, low cost, high loading capacity, ease of regeneration and reuse, and the wide 
range of pore sizes. Porous alumina, silica, and Celite have found widespread use as 
supports for immobilized enzymes, with titanium- and carbon-based supports seeing more 
limited application. 

Enzymes can be immobilized to inorganic supports by three mechanisms. The 
10 enzyme may associate with the inorganic support through ionic interactions or may bind 
through an ion-exchange mechanism to an ionic polymer which has been impregnated into 
the inorganic support, or be crosslinked to the ionic polymer using a bifunctional chemical 
linker. The first approach has not seen wide applicability because the weak ionic 
interactions frequently lead to enzyme leaching. Poiyethyleneimine (PEI) is the polymer of 
15 choice for impregnation because of low cost. The amino groups allow a wide range of 
crosslinking chemistry to be applied. Glutaraldehyde is by far the most studied and 
inexpensive crosslinking agent used. Studies with rRDhi focused entirely on this coupling 
chemistry. 

Established methods for the preparation of PEI-impregnated porous supports, 
20 followed by glutaraldehyde crosslinking were used (U.S. Patent No. 4,268,410 and Mosbach, 
immnhiiiTRri FnyymBs. in 44 Methods in Enzvmoloov . (1976) (Academic Press, NY)). The 
recovery of rRDhI activity was initially screened for two supports. Porous silica already 
impregnated with PEI was obtained from Sigma (nominal pore size of 250 A). Celite R-648 
was obtained from Manville (nominal pore size of about 150 A) and impregnated with PEI 
25 (avg. 50,000 MW) from Sigma according to the methods of U.S. Patent No. 4,268,410. Both 
supports were treated with glutaraldehyde (GIA) and then washed exhaustively with water. 
5.5 units (1 .0 mL at 0.55 mg protein/mL) of a highly purified rRDhI enzyme preparation 
(>98% pure by SDS-PAGE) was used to couple to 500 mg each of the two glutaraldehyde- 
treated, PEi-impregnated supports. This loading level (0.11% w/w) was assumed to be at 
30 least two orders of magnitude below the known loading capacity of the supports. The 
enzyme was incubated with the supports, with gentle shaking overnight (18 hr) at 4 ""C, 
before washing exhaustively to remove unlinked protein. Re-assay of the two supports with 
DCB demonstrated recoveries of 40% for the PEI-Celite R-648 and 31% for the PEI-Silica. 
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These samples were stored at room temperature under reaction conditions (saturating DCB) 
for 1 week and re-assayed. The Celite immobilized enzyme preparation lost 49% of activity 
in a week while the Silica immobilized enzyme preparation lost 28% of its activity. 

In order to quickly determine which type of inorganic support would provide the best 
recovery of rRDhI activity, several commercially available porous supports were screened. 
As in the previous experiment, loading levels of the highly purified rRDhI enzyme were set 
greater than three orders of magnitude (0.0055 % w/w) below the expected loading 
capacities of the supports in order to compare the supports on the sole basis of activity 
recovered, independent of loading capacity, pore size and so forth. 

Three porous aluminas, three porous silicas, and two porous carbons were screened. 
Additionally, Sigma PEI-Silica and Celite R-648 (evaluated in the previous screen) were 
reevaluated under the same conditions. All supports were impregnated with PEI and treated 
with glutaraldehyde as before. 25 |iL of enzyme preparation (13.8 ^g) were incubated with 
each support for 72 hours at 4 ''C with gentle agitation to ensure maximum loading. Enzyme 
loading in the bathing solution was monitored by measurement of A^g^ at 24 and 72 hours. 
Activity against DCB was then determined in the bathing solution (unbound) and on the 
washed gels (bound) after the 72 hour incubation. Tables 6 and 7 show the results of these 
studies. 



Table 6 rRDhI Enzyme Uptake into PEI-imoreanated GIA treated Porous 

Supports as Monitored by A28Q 



Support 



%Loaded @ 24 hr 



% Loaded @ 72 hr 



Alumina - Davison Low SA 


83% 




62% 


Alumina - Norton SA 6176 


86% 




84% 


Alumina - Calcicat Type C 


84% 




67% 


Silica - Calcicat S-88-473 TypeA 


69% 




72% 


Silica - Shell 5980-F 


81% 




93% 


Silica - Davison 952-08-5X 


91% 




92% 


Carbon - Borecker Subunit 


77% 




91% 


Carbon - AmCy 5701 -Sn 


90% 




95% 


Celite - Manville R648 


82% 




95% 


PEI-Silica - Siama 


85% 




93% 
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Table 7 Recovery of Enzyme Activity on PEI-lmpregnated GIA treated Supports 



Support 


% Bound 


% Unbound 


%Lost* 


Alumina - Davison Low SA 


7% 


38% 


55% 


Alumina - Norton SA 6176 


3% 


17% 


80% 


Alumina - Calcicat Type C 


7% 


34% 


59% 


Silica - Calcicat S-88-473 Type A 


12% 


28% 


60% 


Silica - Shell 5980-^ 


12% 


8% 


80% 


Silica - Davison 952-08-5X 


17% 


11% 


72% 


Carbon - Borecker Subunit 


5% 


9% 


86% 


Carbon - AmCy 5701 -Sn 


7% 


5% 


88% 


Celite • Manville R648 


21% 


5% 


74% 


PEI-Silica - Sioma 


6% 


7% 


87% 



15 *% Lost = 100% - (% Bound + % Unbound) 

Each support exhibits a different uptake profile ranging from 62% to 95% uptake after 
72 hours. For most systems, 72 hours is adequate to achieve maximum loading of protein 
achievable at 4 °C. However, the three alumina systems actually showed greater uptake at 

20 24 hours than at 72 hours. Recovery of bound enzyme activities at 72 hours ranged from 
3% to 21% as compared to an untreated soluble enzyme control. Considerable activity was 
unaccounted for or "lost" in all systems examined, ranging from 55% to 87%. Also, the 
previously run supports (Sigma PEI-Silica and Manville Celite R648) showed poorer bound 
recoveries. This could be due to either the lower enzyme loading ratio or the longer 

25 incubation times used in this experiment. Given the efficiency of binding and the recovery of 
bound enzyme activity, Celite, silica, carbon, and alumina all function as effective 
immobilization support materials in the present invention, although Celite and silicas 
outperform alumina and carbon. In terms of stability, however, alumina supports appear to 
perform better (see below). 

30 The bound samples were also submitted to a long term stability study. Following 

assay using DCB as a substrate, supports were rinsed and immersed in TCP-saturated 
buffer. At the given time-point, TCP buffer was removed, supports were rinsed again and 
assayed with DCB. 
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Table 8 Long term stability study of PEI cross-linked supports at room temperature. 



Ci jnnnrt 


% Activitv Maintained 




at 41 hr 


at 136 hr 


Alumina 1 


38 


57 


Alumina 2 


66 


67 


Alumina 3 


58 


75 


Silica 1 


76 


81 


Silica 2 


79 


46 


Silica 3 


60 


30 


Carbon 1 


75 


0 


Carbon 2 


37 


48 


Celite 


57 


12 


PEI-Silica 


43 


60 



Thus the two supports which exhibited intermediate levels of recovery in the 
immobilization reaction, silicas and aluminas, proved to have the best stabilities over time. 
All of these supports were also screened for their ability to bind the enzyme directly without 
10 PEI or GIA modification. However, binding was very poor and irreproducible, and allowed 
easy removal of enzyme from the supports with washing. 

Polyethvlenelmine-impreanated Inorganic Supports with Enzym e Bound bv Ion Exchange 

The molecular weight of PEI is also known to have an impact on the overall yield and 
stability of immobilized enzymes. In addition, PEI is capable of functioning either as an ion 

15 exchange ligand on various supports or as a glutaraldehyde cross-link acceptor. For these 
reasons, PEIs of two different molecular weights were impregnated onto various porous 
inorganic supports following the method described in the previous section. In these 
experiments, however, enzyme (semi-purified preparations containing 1-2 U/mL) was bound 
by ion-exchange but the GIA crosslinking step was omitted. Samples were submitted for a 

20 stability screen to determine if the size of the PEI was an important factor. 
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Table 9 Stability Screen of PEi-lmpregnated Porous Supports. No Crosslinking 



Support Supplier MW RE I % Activity Maintained 

at24hr at12Qhr 



1 


Alumina 


Norton SA6176 


50,000 


70 


62 


2 


Alumina 


Calcicat Type C 




61 


42 


3 


Silica 


Calcicat S-88-473 Type A 


H 


101 


64 


4 


Silica 


Shell 5980-F 


II 


80 


55 


5 


Carbon 


Borecker Subunit 


It 


41 


32 


6 


Carbon 


AmCy 5701 -Sn 


II 


39 


17 


7 


Celite 


Manville R648 


n 


83 


50 


8 


Alumina 


Norton SA6176 


2000 


82 


55 


9 


Alumina 


Calcicat Type C 




91 


61 


10 


Silica 


Calcicat S-88-473 Type A 




94 


57 


11 


Silica 


Shell 5980-F 




76 


55 


12 


Carbon 


Borecker Subunit 




44 


50 


13 


Carbon 


AmCy 5701 -Sn 




42 


21 


14 


Celite 


Manyille R648 




58 


2 



5 

With the exception of Celite, the different molecular weights of PEI did not appear to 
haye a major impact on either immobilization efficiency or stability of the enzyme. 

Example 4 

10 Construction of pRSET-RDhl.Nde 

The pRSET-RDhl.Nde expression vector was generated by digesting plasmid pRSET 
RDhI clone 16-4 with the restriction enzymes, Nde I and Hind III, and then incorporating into 
the construct a RDhI gene fragment which contained a Nde I site at its 5' end and a Hind 111 
site at its 3' end. The new construct was then transformed into E. co// JM109 competent 

15 cells (from Invitrogen of Carlsbad, CA, USA) and ampicillin resistant colonies were picked. 
Plasmids containing the RDhI gene were identified by analytical restriction enzyme digestion 
and referred to as the pRSET-RDhl.Nde construct. 
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Production of Recombinant RDhI Protein 

Both of the new constructs - pRSET-RDhl.Nde and pTrcHis-RDhI - were 
transformed into E. co// B834(DE3) competent cells (from Novagen, Inc. of Madison, Wl, 
USA). The production of active dehalogenase enzymes was confirmed by a dehalogenation 
5 activity assay and enzyme production levels were investigated with PAGE. Dehalogenation 
activity was measured by using a colorimetric >.iloride release assay at 460 nm to assess 
enzymaLj dechlorination activity toward 1,4-dichlorobutane (DCS). 

We obsen/ed the enhanced production of recombinant RDhI enzyme in this host-E. 
CO// B834{DE3) competent cell. The following table shows the relationship between 
10 dehalogenating activity and the percent of rRDhI enzyme in the total soluble protein among 
different expression systems and host cells. 



Expression 
System 


Competent 
Host Cell 


rPDhl as % of 
Soluble Protein 


DCB* Activity per mL 
of Culture (x 10 ') 


pEXPROK 


E. coll AG 1' 


-3 


-0.3 


pRSET 


E. coll JM 109 


-10 


-0.8 


pTrcHis 


E. coliTOP 10F'* 


-15 


-2.4 


pTrxFus 


E. coliGI 174* 


-30 


-4.8 


pTrcHis 


E. coli B834(DE3) 


-42 


-4.5 - 12.5 


pRSET 


E. coli B834(DE3) 


-48 


-14.8 



* DCB unit is a measure of dechlorination activity toward 1,4-dichlorobutane (DCB). 
15 * E. coli AG 1 chemically competent cells were purchased from Stratagene (La Jolla, CA, USA); 

E coli TOP 10F' chemically competent cells were purchased from invitrogen (Carlsbad, CA, 
USA); E. co// Gl 174 cells were purchased from Invitrogen (Carlsbad, CA) and were made 
electro-competent according to the supplier's instructions. 

20 

Example 5 

Modified Rhodococcus Dehalooenase 

Since the Rhodococcus dehalogenase being produced by the TrcHis RDhI construct 
had been modified with additional amino acids at both the amino and carboxy termini, 
25 plasmid constructs were generated to test the effects each of these modifications might have 
on the activity of the enzyme. The amino terminal poly-histidine tail was eliminated by 
enzymatic digestion of the pTrcHis RDhI 18-3 plasmid with Nco\ and Age\ and the ligation of 
a 17 bp oligo into the resulting gap. 
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The DNA sequences of the oligo pairs are as follows: 

RDhI Delta His-6-F 

5'-CATGGGTGAAATAGGTA-3' 

5 

RDhI Delta His-6-R 

5*-CCGGTACCTATTTCACC-3' 

Using standard molecular biology protocols, the His-6-F and His-6-R oligonucleotides 
10 were annealed, ligated into the digested 18-3 construct, and transformed into competent E. 
coll TOP10 F' cells. Transformed colonies were selected by growth on LB/Amp agar plates. 
The resulting amino terminal sequence was: 

-12 -11 3 
1 5 ATG GGT GAA ATA GGT 

Met Gly lie 

(shown with the amino acid numbering of the original, unmodified sequence). 

Digestion and re-iigation resulted in a construct in which the Ala-293 Ser-294 
20 sequence (Figure 2) became an Ala-293 Arg-294 sequence. Following the Arg-encoding 
codon is a stop codon which corresponds to the TGA nucleotide tri-mer at bases 927-979 in 
the original sequence. 

The carboxy terminus EXFLAG was eliminated by digesting pTrcHls RDhI 18-3 with 
Ai^rll and Nhe\ and re-ligating the plasmid. 

25 Individual clones were screened by enzymatic digestion and gel electrophoresis. 

Candidate clones were grown at 37° C in 5 mL cultures, induced with IPTG, and lysed by 
sonlcation. The lysates were analyzed by PAGE, Western blot, and chloride detection 
assay. Those clones lacking the amino terminal poty-histidines or the carboxy terminal 
EXFLAG demonstrated catalytic activity equal to the original construct. 

30 

Example 6 

Construction of oTrcHis RDhl-S-Tag and pRSET RDhl-S-Taa 

Material and Methods: 

CTERM S-Tag F (fonward) and CTERM S-Tag R (reverse) are two primers that were 
35 designed to change the FLAG polypeptide - an 11 amino acid sequence - to the S-Tag 
polypeptide, a 15 amino acid sequence. The sequences of these oligonucleotides are as 
follows (each strand of the S-Tag fragment is underlined): 
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10 



15 



CTERM S-Tag F 



CTERM S-Tag R 



— Avr II— 

5'-CTA GGT GAC AAA G AA ACC OCT OCT OCT AAA 
-Nsp V— 

TTC GAA CGC CAG CAC ATG GAC AGC AAA TAA 
GTT TAA ACA TCA TTCCAATTGC 
—Not I— 

5'-GGCCGCAATTGGAATGATGTTTA AAC TTA TTTGCT 

_.-Nsp V— 

GTC CAT GTG CTG GCG TTC GAA TTT AGC AGC AGC 
GGT TTC TTT GTCAC 



Construction of pTrcHls RDhl-S-Taa 

To generate plasmid pTrcHis RDhl-S-Tag, the plasmid pTrcHis RDhI clone 18-3 was 

20 digested with the restriction enzymes AvrW and Not I and ligated with the S-Tag fragment 
(the S-Tag fragment was prepared by annealing primer CTERM S-Tag F and primer 
CTERM S-Tag R together at room temperature). The new construct, pTrcHis RDhl-S-Tag, 
was incorporated into £ coli AG 1 competent cells (from Stratagene of La Jolla, CA, USA) 
and ampiciliin resistant colonies were picked. Plasmids containing the S-Tag fragment were 

25 identified by analytical restriction enzyme digestion. 

Construction of pRSET RDhl-S-Taa 

The same procedure that was used to construct pTrcHis RDhl-S-Tag was also used 
to construct pRSET RDhl-S-Tag, but instead starting with the plasmid pRSET RDhI clone 
1 6-4 - digested with the restriction enzymes, Avr II and Not I - and ligating that construct to 
30 the S-Tag fragment described above. 

Semi-purified rRDhI (the EXFLAG-tagged protein derived from TrcHis RDhI Clone 
18-3) was compared kinetically with semi-purified RDhl-S-Tag protein produced in this 
example. Chloride-releasing activity was examined at TCP concentrations ranging from 
OmM to 5mM. As shown in Figure 19, the S-Tag-modified protein exhibited a consistent 
35 increase of about 15% in Vmax over the EXFLAG-modified protein. The S-Tag protein also 
exhibits a -25% lower Km for TCP than does the EXFLAG protein. These results confirm 
that changes at the C-terminal end of the TDhI enzyme can be used to modulate and 
improve activity of the enzyme. 
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Example 7 

Reactor Design and Performance 
Reactor Set-Up: 

A bench-scale reactor was assembled using all 316 stainless steel with inch ID, in 
5 a shell and tube design. Reactor inlet and outlet tuoing were also stainless steel. A Lauda 
circulating water bath (with a thermostat) was u .ad to maintain the reactor at 30''C. The 
reactor w-s packed with immobilized rRDhI enzyme running in an up-flow direction. The 
immobilized rRDhI enzyme was first prepared by loading a partially purified enzyme 
preparation (approximately 70% purity by SDS-PAGE) onto PEI-impregnated alumina (ISP 

10 4000 grade from UOP) which had been pretreated with 25% (w/v) glutaraldehyde for 2 
hours; this was followed by extensive washing with distilled water. Sufficient protein was 
introduced to the alumina to provide 300 mg protein (by Lowry method) per gm of support. 
Binding was allowed to occur overnight at room temperature. Bound enzyme activity was 
estimated by measuring unbound enzyme activity in the bathing solution or by final 

1 5 absorbance at 280 nm . 

The immobilized rRDhI enzyme was transferred to the reactor using 2 mm glass 
beads as spacers at the inlet and outlet Flow was initiated using an aqueous feed of a pre- 
warmed 10 mM sodium phosphate/10 |iM EDTA buffer (pH 7.0). After several hours of wash 
to remove any unbound enzyme, the aqueous feed was saturated with 1,2,3- 

20 trichloropropane (TCP) and delivered as a continuously stirred solution at a flow rate of 0.15 
ml_/min. The reactor was allowed to equilibrate for - 2 residence times before sampling the 
inlet and outlet streams for analysis of reactant TCP and product 2,3-dichloro-1 -propanoi 
(DCH) concentrations by GC. In order to prepare the samples for GC, each sample was first 
saturated with sodium sulfate and then extracted with chloroform (2 volumes) containing 

25 10mM each of two internal standards (1,1,1,2-tetrachloroethane and 3-chloro-1 -propanoi). 
TCP and DCH levels were then estimated from the GC data using the internal standard 
method and the productivity (the percent yield per volume per time) was calculated 
therefrom. This initial productivity was used as a measure of initial enzyme activity. 

Productivity of Bench Scale rRDhI Bioreactor 
30 The bioreactor was run continuously for a period of three months, with periodic 

sampling of the inlet and outlet streams according to the above-described method. 
Volumetric productivity (product weight per fluid volume per minute) of the enzyme was 
determined at each time point and the percent conversion felt from about 60% to about 40% 
over this time period. Measurements are presented in Figure 16. Based on this data, the 



wo 98/36080 



PCT/US98/02776 



half life of the immobilized enzyme was estimated to be about 3500 hours. This places the 
immobilized dehalogenase among the most stable protein catalysts yet reported. 

Example 8 

5 Directed Evolution of Dehalogenases by EPPCR 

Error-Prone PGR Mutagenesis was performed upon Agel- and A//iel-digests of 
plasmid pTrc/His RDh1 18-3 which had been purified by agarose gel electrophoresis and 
extracted from the gel. The EPPCR products were ligated into expression vectors having 
Agel and Nhe\ cutting sites. The resulting plasmids were transformed into competent AGI 

10 cells which were grown into colonies on ampicillin-supplemented agar. The resulting cell 
clone "pTric/His RDhl" EPPCR library was tested using the procedure for measuring RDhI 
enzyme activity by detection of pH change, as follows: wells B1 to H1 2 of a 96-well 
microplate, each containing 200|aL SOB/Amp broth, was inoculated with a single colony from 
the pTrc/His RDhl EPPCR library; wells A1-A6 contain only media as a negative control, and 

15 wells A7-A12 were inoculated with wild type pTrc/His RDhl 18-3 colonies as a positive 
control. Representative results are presented in Figures 17 and 18. 

The results demonstrate that most of the clones produced by EPPCR mutagenesis 
exhibit activities equal to or less than the activity range for the wild-type RDhl produced by 
TrcHis RDhl Clone 18-3. However, in each case in which 84 random clones from an EPPCR 
20 library were analyzed for dehalogenase activity, a few on each 96-well plate exhibited activity 
significantly higher than that of the wild-type enzymes. 

On February 3, 1998, the three plasmids, pTrcHis RDhl clone 18-3, pRSET RDhl 
clone 16-4, and pTrxFus RDhl clone 4, were deposited with the American Type Culture 
Collection (ATCC) in accordance with the Budapest Treaty and were respectively given the 
25 following designations: ATCC 209609, ATCC 209610, and ATCC 20961 1. On February 3, 
1998, the cell culture E. co// TrxFus RDhl done 4, was deposited with the American Type 
Culture Collection (ATCC) in accordance with the Budapest Treaty and was given the 
following designation: ATCC 202087. 

On January 30, 1998, the two cell cultures, E. co// TrcHis RDhl clone 18-3 and £ coli 
30 RSET RDhl clone 16-4 were deposited with the American Type Culture Collection (ATCC) in 
accordance with the Budapest Treaty and were respectively given the following 
designations: ATCC 202086 and ATCC 202085. 
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Other embodiments of the invention will be apparent to those skilled in the art from a 
consideration of this specification or practice of the invention disclosed herein. It is intended 
that the specification and examples be considered as exemplary only, with the true scope 
and spirit of the invention being indicated by the following claims. 
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GENERATION INFORMATION: 
APPLICANT: 

NAME: The Dow Chemical Company 
STREET: 1790 Bldg . Washington Street 
CITY: Midland 
STATE: MI 
COUNTRY: U.S.A. 
POSTAL CODE: 4 8 674 
TELEPHONE: 517-636-1687 
TELEFAX: 517-638-978 6 



TITLE OF INVENTION: Recombinant Haloaliphatic Dehalogenases 



NUMBER OF SEQUENCES: 26 



COMPUTER READABLE FROM: 

MEDIUM TYPE: 3-1/2" Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: MS-DOS 
SOFTWARE : PatentIn 



INFORMATION FOR SEQ ID N0:1: 



SEQUENCE CHARACTERISTICS 
LENGTH: 305 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



ORIGINAL SOURCE: 

ORGANISM: Rhodococcus rhodocrous 
INDIVIDUAL ISOLATE: TDTM003 

FEATURE : 

NAME/KEY: RDhl Enzyme 
LOCATION: 1..2 92 

FEATURE: 

NAME/KEY: Carboxy-terminal EXFLAG tail 
LOCATION: 295. .305 

FEATURE: 

NAME/KEY: Amino-terminal poly-His tail 
LOCATION: -10. .-1 

SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Gly Gly Ser His His His His His His Gly Met Ser Glu He Gly 
-12 -10 -5 -1 1 



Thr Gly Phe Pro Phe Asp Pro His Tyr Val Glu Val Leu Gly Glu Arg 
5 10 

Met His Tyr Val Asp Val Gly Pro Arg Asp Gly Thr Pro Val Leu Phe 



50 



wo 98/36080 

Leu His Giy Asn Pro Thr Ser Ser 

His Val Ala Pro Ser His Arg Trp 

Gly Lys Ser Asp Lys Pro Asp Leu 

Arg Tyr Leu Asp Ala Phe lie Glu 

Leu Val He His Asp Trp Gly Ser 

Arg Asn Pro Glu Arg Val Lys Gly 

Pro He Pro Thr Trp Asp Glu Trp 

Gin Ala Phe Arg Thr Ala Asp Val 

Asn Ala Phe He Glu Gly Val Leu 

Thr Glu Val Glu Met Asp His Tyr 

Asp Arg Glu Pro Leu Trp Arg Phe 

Glu Pro Ala Asn He Val Ala Leu 

His Gin Ser Pro Val Pro Lys Leu 

Leu He Pro Pro Ala Glu Ala Ala 

Cys Lys Thr Val Asp He Gly Pro 

Asn Pro Asp Leu He Gly Ser Glu 

Ala Ser Lys Leu Gly Asp Tyr Lys 
295 300 
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Tyr Leu Trp Arg Asn He He Pro 

He Ala Pro Asp Leu He Gly Met 

Asp Tyr Phe Phe Asp Asp His Val 

Ala Leu Gly Leu Glu Glu Val Val 

Ala Leu Gly Phe His Trp Ala Lys 

lie Ala Cys Met Glu Phe lie Arg 

Pro Glu Phe Ala Arg Glu Thr Phe 

Gly Arg Glu Leu He He Asp Gin 

Pro Lys Cys Val Val Arg Arg Leu 

Arg Glu Pro Phe Leu Lys Pro Val 

Pro Asn Glu He Pro He Ala Gly 

Val Glu Ala Tyr Met Asn Trp Leu 

Leu Phe Trp Gly Thr Pro Gly Val 

Arg Leu Ala Glu Ser Leu Pro Asn 

Gly Leu His Tyr Leu Gin Glu Asp 

He Ala Arg Trp Leu Pro Gly Leu 

290 

Asp Asp Asp Asp Lys 
305 
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SEQ ID N0:2 
RDhl Fig. 2 DNA 



cc 


ATG 


GGG 


GGT 


GGT 


ACC 


GGT 


TTT 


CGT 


ATG 


CAC 


TAC 


TTC 


CTG 


CAC 


GGT 


CCG 


CAT 


GTA 


GCA 


ATG 


GGA 


AAA 


TCG 


GTC 


CGC 


TAC 


CTC 


GTC 


CTG 


GTC 


ATC 


AAG 


CGC 


AAT 


CCG 


CGG 


CCT 


ATC 


CCG 


TTC 


CAG 


GCC 


TTC 


CAG 


AAC 


GCT 


TTC 


CTT 


ACG 


GAG 


GTC 


GTT 


GAC 


CGA 


GAG 


GGT 


GAG 


CCC 


GCG 


CTG 


CAC 


CAG 


TCA 


GTA 


CTG 


ATC 


CCC 


AAC 


TGC 


AAG 


ACA 


GAC 


AAC 


CCG 


GAC 


CTG 


GCT 


AGC 


GGC 


GCGGCCGC AAGCTT 



SEQ ID NO: 3 
TCCH aa 



Met 


Ser 


Leu 


Gly 


Lys 


Gly 


Arg 


Arg 


Leu 


Phe 


Gin 


His 


Met 


Pro 


His 


Cys 


Gly 


Met 


Gly 


Asp 


Ala 


Tyr 


Ala 


Glu 


Asp 


Leu 


Gly 


Asp 


Leu 


Gly 


Phe 


Asp 


Ala 


Tyr 


Met 


Glu 


Glu- 


Gin 


Asp 


Arg 


Glu 


Leu 


Val 


Leu 


Leu 


He 


Leu 


Arg 


Pro 


Phe 


Leu 


Ala 


Gin 


He 


Pro 


He 


Asp 


Tyr 


Ala 


Gly 


Asn 


Ala 


Glu 


Pro 


Arg 


Thr 


Trp 


Pro 


He 


Gin 


Glu 


Asp 


Val 


Arg 


Arg 


Leu 


SEQ 


ID NO: 4 




Rlucif ; 


aa . 




Met 


Thr 


Ser 


Lys 


Gly 


Pro 


Gin 


Trp 


Phe 


He 


Asn 


Tyr 


Phe 


Leu 


His 


Gly 



TCT 


CAT 


CAT 


CAT 


CCC 


TTC 


GAC 


CCT 


GTC 


GAT 


GTT 


GGA 


AAC 


CCG 


ACC 


TCG 


CCG 


ACT 


CAT 


CGG 


GAC 


AAA 


CCA 


GAC 


GAT 


GCC 


TTC 


ATC 


CAC 


GAC 


TGG 


GGC 


GAA 


CGG 


GTC 


AAA 


ACG 


TGG 


GAC 


GAA 


CGG 


ACC 


GCC 


GAC 


ATC 


GAG 


GGT 


GTG 


GAG 


ATG 


GAC 


CAC 


CCA 


CTG 


TGG 


CGA 


AAC 


ATC 


GTC 


GCG 


CCT 


GTC 


CCG 


AAG 


CCG 


GCC 


GAA 


GCC 


GTG 


GAC 


ATC 


GGC 


CTT 


ATC 


GGC 


ACT 


CTA 


GGT 


GAC 


TAC 



Ala 


Lys 


Pro 


Phe 


Met 


Ala 


Tyr 


He 


Gly 


Asn 


Pro 


Thr 


Ala 


Gly 


Leu 


Gly 


Ser 


Asp 


Lys 


Leu 


His 


Arg 


Asp 


Tyr 


Arg 


Val 


Val 


Leu 


Trp 


Ala 


Arg 


Arg 


Ala 


He 


Ala 


Met 


Asp 


Leu 


Phe 


Gin 


Gin 


Asp 


Asn 


Val 


Pro 


Leu 


Ser 


Glu 


Ala 


Glu 


Ala 


Arg 


Ala 


Gly 


Thr 


Pro 


Trp 


Leu 


Ser 


Glu 


Gly 


Ala 


Leu 


Thr 


Asn 


Gin 


Thr 


Glu 


Ser 


Pro 


Asp 


Glu 


Arg 


Pro 


Ala 




Val 


Tyr 


Asp 


Pro 


Trp 


Ala 


Arg 


Cys 


Tyr 


Asp 


Ser 


Glu 


Asn 


Ala 


Ala 


Ser 



CAT CAT CAT GGT 
CAT TAT GTG GAA 
CCG CGG GAT GGC 
TCC TAC CTG TGG 
TGC ATT GCT CCA 
CTC GAT TAT TTC 
GAA GCC TTG GGT 
TCA GCT CTC GGA 
GGT ATT GCA TGT 
TGG CCG GAA TTC 
GTC GGC CGA GAG 
CTC CCG AAA TGC 
TAT CGC GAG CCC 
TTC CCC AAC GAG 
CTC GTC GAG GCA 
TTG TTG TTC TGG 
GCG AGA CTT GCC 
CCG GGA TTG CAC 
GAG ATC GCG CGC 
AAG GAC GAT GAT 



Gly Glu Lys Lys 
Asp Glu Gly Thr 
Ser Ser Tyr Leu 
Arg Leu He Ala 
Asp Pro Ser Gly 
Leu Asp Ala Leu 
Val Val His Asp 
His Arg Glu Arg 
Pro He Glu Trp 
Ala Phe Arg Ser 
Phe Val Glu Gin 
Ala Glu Met Ala 
Arg Pro Thr Leu 
Ala Asp Val Val 
Ser Pro He Pro 
Thr Gly Arg Met 
He Thr Val Ala 
He Gly Ala Ala 



Glu Gin Arg Lys 
Lys Gin Met Asn 
Lys His Ala Glu 
Ser Tyr Leu Trp 



ATG TCT GAA ATA 4 7 
GTC CTG GGC GAG 
ACG CCT GTG CTG 
CGC AAC ATC ATC 
GAC CTG ATC GGG 
TTC GAC GAC CAC 
TTG GAA GAG GTC 
TTC CAC TGG GCC 
ATG GAA TTC ATC 
GCC CGT GAG ACC 
TTG ATC ATC GAT 
GTC GTC CGT CCG 
TTC CTC AAG CCT 
ATC CCC ATC GCC 
TAC ATG AAC TGG 
GGC ACA CCC GGC 
GAA AGC CTC CCC 
TAC CTC CAG GAA 
TGG CTC CCC GGA 
GAC AAA TAA TGA 



Phe He Glu He 
Gly Asp Pro He 
Trp Arg Asn He 
Cys Asp Leu He 
Pro Glu Arg Tyr 
Trp Glu Ala Leu 
Trp Gly Ser Ala 
Val Gin Gly He 
Ala Asp Phe Pro 
Gin Ala Gly Glu 
Val Leu Pro Gly 
Ala Tyr Arg Glu 
Ser Trp Pro Arg 
Ala He Ala Arg 
Lys Leu Phe He 
Arg Asp Phe Cys 
Gly Ala His Phe 
He Ala Ala Phe 



Arg Met He Thr 
Val Leu Asp Ser 
Asn Ala Val He 
Arg His Val Val 
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Pro His lie Glu Pro Vai Ala Arg Cys He He Pro Asp Leu He Gly 
Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg Leu Leu Asp 
His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 
Lys He He Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
Tyr Ser Tyr Glu His Gin Asp Lys He Lys Ala He Vai His Ala Glu 
Ser Val Val Asp Val He Glu Ser Trp Aso Glu Trp Pro Asp He Glu 
Glu Asp He Ala Leu He Lys Ser Glu Glu Gly Glu Lys Met Val Leu 
Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys He Met Arg 
Lys Leu Glu Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu 
Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu He Pro 
Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin He Val Arg Asn Tyr 
Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe He Glu 
Ser Asp Pro Gly Phe Phe Ser Asn Ala He Val Glu Gly Ala Lys Lys 
Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 
Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr He Lys Ser Phe Val Glu 
Arg Val Leu Lys Asn Glu Gin 

SEQ ID N0:5 
XDhl aa. 

Met He Asn Ala He Arg Thr Pro Asp Gin Arg Phe Ser Asn Leu Asp 

Gin Tyr Pro Phe Ser Pro Asn Tyr Leu Asp Asp Leu Pro Gly Tyr Pro 

Gly Leu Arg Ala His Tyr Leu Asp Glu Gly Asn Ser Asp Ala Glu Asp 

Val Phe Leu Cys Leu His Gly Glu Pro Thr Trp Ser Tyr Leu Tyr Arg 

Lys Met He Pro Val Phe Ala Glu Ser Gly Ala Arg Val He Ala Pro 

Asp Phe Phe Gly Phe Gly Lys Ser Asp Lys Pro Val Asp Glu Glu Asp 

Tyr Thr Phe Glu Phe His Arg Asn Phe Leu Leu Ala Leu He Glu Arg 

Leu Asp Leu Arg Asn He Thr Leu Val Val Gin Asp Trp Gly Gly Phe 

Leu Gly Leu Thr Leu Pro Met Ala Asp Pro Ser Arg Phe Lys Arg Leu 

He He Met Asn Ala Cys Leu Met Thr Asp Pro Val Thr Gin Pro Ala 

Phe Ser Ala Phe Val Thr Gin Pro Ala Asp Gly Phe Thr Ala Trp Lys 

Tyr Asp Leu Val Thr Pro Ser Asp Leu Arg Leu Asp Gin Phe Met Lys 

Arg Trp Ala Pro Thr Leu Thr Glu Ala Glu Ala Ser Ala Tyr Ala Ala 

Pro Phe Pro Asp Thr Ser Tyr Gin Ala Gly Val Arg Lys Phe Pro Lys 

Met Val Ala Gin Arg Asp Gin Ala Cys He Asp He Ser Thr Glu Ala 

He Ser Phe Trp Gin Asn Asp Trp Asn Gly Gin Thr Phe Met Ala He 

Gly Met Lys Asp Lys Leu Leu Gly Pro Asp Val Met Tyr Pro Met Lys 

Ala Leu He Asn Gly Cys Pro Glu Pro Leu Glu He Ala Asp Ala Gly 

His Phe Val Gin Glu Phe Gly Glu Gin Val Ala Arg Glu Ala Leu Lys 

His Phe Ala Glu Thr Glu 
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SEQ ID NO: 6 
RDhl 5.4 

GGTTCCATGG GNTTYCCNTT YGAYCCNCAY TA 

SEQ ID NO: 7 
RDhl 3.12 

CAGAGCTAGC GAGTCCGGGG AGCCAGCG 

SEQ ID NO: 8 
RDhl 5.7 

CGTACATATG GCCATGGGGG GTTCTCATCA TCATCATCAT CATGGTATGT CTGAAATAGG 
TACCGGTTTT CCCTTCGACC CTCATTA 

SEQ ID NO: 9 
RDhl 3.13 

GATGACAAAT AATGAGCGGC CGCAAGCTTG TAG 

SEQ ID NO: 10 
Trx2++ 

CCGGGGATCC CATGGCTTCT GAAATACGTA CCGGTTTTCC CTTCGACCCT CATTA 

SEQ ID NO: 11 
Trx- 

TCGACTGCAG GCGGCCGCTC ATTATTTGTC ATC 

SEQ ID NO: 12 
Dhl Seq 7 

CCTGTCCCGA AGTTGTTG 

SEQ ID NO: 13 
Dhl Seq 8 

CGGGCCGATC TCCACTG 

SEQ ID NO: 14 
Dhl Seq 11 

TGCTCCAGAC CTGATCG 

SEQ ID NO: 15 
Dhl Seq 12 

TCTGATCGAT GATCAAC 

SEQ ID NO: 16 
Dhl Seq 13 

TCCCGACGTG GACGAATG 
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SEQ ID NO: 17 
Dhl Seq 14 

GAGCGCGACG ATGTTCGC 

SEQ ID NO: 18 
Dhl Seq 15 

CACCCGGCGr ACTGATCC 

SEQ ID NO: 19 
Dhl Seq 18 

GAGACCGGTC AGCATTCC 

SEQ ID NO:20 
PROK-Seql 

GAGCGGATAA CAATTTCA 

SEQ ID N0:21 
PR0K-Seq2 

TCTCATCCGC CAAAACAG 

SEQ ID NO:22 
EXFLAG linker 

GAATTCAGCC ATGGCATAAG CTTTCTAGAC TCGAGGGAGC TAGCGGCCTA GGTGACTACAA 
GGACGATGAT GACAAATAAT GAGCGGCCGC TAGCTT 

SEQ ID NO:23 

RDhl Delta His-6-F 

CATGGGTGAA ATAGGTA 

SEQ ID NO:24 

RDhl Delta His-6-R 

CCGGTACCTA TTTCACC 

SEQ ID NO:25 
CTERM S-Tag F 

CTAGGTGACAAAGAAACCGCTGCTGCTAAATTCGAACGCCAGCACATGGACAGCAAATAAGTTTAAACATCA 
TTCCAATTGC 

SEQ ID NO:26 
CTERM S-Tag R 

GGCCGCAATTGGAATGATGTTTAAACTTATTTGCTGTCCATGTGCTGGCGTTCGAATTTAGCAGCAGCGGTT 
TCTTTGTCAC 
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What is Claimed is: 

I . An enzyme capable of converting a halogenated alipliatic hydrocarbon to a 
halohydrin, said enzyme contains a polypeptide substantially homologous with the amino 
acid sequence of residues 1-292 of Figure 2. 

5 2. The enzyme of Claim 1 wherein said enzyme contains an amino acid 

sequence which is at least about 90% to 100% homologous with the amino acid sequence of 
residues 1-292 of Figure 2. 

3 xhe enzyme of Claim 2 wherein said enzyme contains an amino acid 
sequence which is at least about 95% homologous with the amino acid sequence of residues 
10 1-292 of Figure 2. 

4. A DNA sequence from which can be expressed a polypeptide substantially 
homologous with the amino acid sequence of residues 1-292 of Figure 2, said polypeptide 
being capable of converting a halogenated aliphatic hydrocarbon to a halohydrin. 

5. The DNA sequence of Claim 4 wherein said polypeptide is at least about 90% 
15 to 100%» homologous with the amino acid sequence of residues 1 -292 of Figure 2. 

6. The DNA sequence of Claim 5 wherein said polypeptide is at least about 95% 
homologous with the amino acid sequence of residues 1-292 of Figure 2. 

7. A DNA sequence containing a polynucleotide substantially homologous with 
the nucleotide sequence of bases 37-912 of Figure 2, said polynucleotide being capable of 

20 expressing a polypeptide which is able to convert a halogenated aliphatic hydrocarbon to a 
halohydrin. 

8. The DNA sequence of Claim 7 wherein said polynucleotide is at least 90% to 
100% homologous with the nucleotide sequence of bases 37-912 of Figure 2. 

9. The DNA sequence of Claim 8 wherein said polynucleotide is at least 95% 
25 homologous with the nucleotide sequence of bases 37-912 of Figure 2. 

10. A microorganism containing a recombinant plasmid wherein the plasmid is 
capable of directing the synthesis of an enzyme containing a polypeptide substantially 
homologous with the amino acid sequence of residues 1-292 of Figure 2. 

I I . The microorganism of Claim 10 wherein the microorganism is of the genus 
30 Escherichia, Pichia, Bacillus, Saccharomyces, Pseudomonas, Rhodococcus, Actinomyces, 

or Aspergillus, 
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12. The microorganism of Claim 1 1 wherein the microorganism is of the genus 
Escherichia. 

13. An expression construct containing a DNA sequence which encodes a 
polypeptide substantially homologous with the amino acid sequence of residues 1-292 of 

5 Figure 2. 

14. An immobilized enzyme having a haloalkane dehalogenase enzyme which 
has haloaliphatic dehalogenase activity and is attached to a solid support. 

15. The enzyme of Claim 14 wherein the enzyme is capable of hydrolytically 
removing at least one halogen substituent from a molecule or group selected from the group 

10 consisting of haiogenated aliphatic hydrocarbon, halogenated aliphatic alcohol, and 
halogenated aliphatic polyol molecules and groups. 

16. The enzyme of Claim 15 wherein said molecule or group has at least one 
halogen atom and 2 to 10 carbon atoms, each of said carbon atoms being independently 
substituted with one or fewer of said halogen atoms, provided that when said molecule or 

15 group is an alcohol or polyol, no carbon atom thereof having a hydroxy substituent also has 
a halogen substituent. 

17. The enzyme of Claim 16 wherein said molecule or group contains at least 2 
halogen atoms. 

18. The enzyme of Claim 17 wherein said molecule or group is a 1 ,2-dihalo 
20 molecule or group. 

19. The enzyme of Claim 17 wherein said molecule or group is selected from the 
group consisting of the 1,2-dihaloethane, 1,2-dlhalopropane, 1 ,2-dihalobutane, and 1,2,3- 
trihalopropane. 

20. The enzyme of Claim 19 wherein said molecule or group is respectively 
25 selected from the group consisting of 1 ,2-dichloroethane, 1 ,2-dichloropropane, 1 ,2- 

dichlorobutane, 1,2-dibromo-3-chloropropane, and 1,2,3-trichloropropane, 

21 . The enzyme of Claim 20 wherein said molecule or group is converted to at 
least one product molecule or product group which is selected from 2-chioro-ethanol, 1- 
chloro-2-propanol, 2-chloro-1-propanol, 1-chloro-2-butanoK 2-chloro-1-butanol, 1-bromo-3- 

30 chloro-2-propanol, 2-bromo-3-chloro-1 -propanol, 2,3-dibromo-1 -propanol, 1 ,2-dichloro-3- 
propanol, and 1,3-dichloro-2-propanot. 
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22. The enzyme of Claim 14 wherein said haloalkane dehalogenase is obtained 
from a Rhodococcus. 

23. A process for preparing an enzyme containing a polypeptide substantially 
homologous with the amino acid sequence of residues 1-292 of Figure 2, which comprises 

5 the steps of: 

1) providing a DNA segment comprising a polynucleotide capable of expressing said 
polypeptide, 

2) inserting said DNA segment into an expression construct, 

3) transfecting a host cell with said expression construct, and 

10 4) providing the host cell with an environment in which it expresses said polypeptide. 

24. The process of Claim 23 further comprising a step of purifying said enzyme 
after step 4 of said process. 

25. A process for preparing an immobilized enzyme containing a polypeptide 
substantially homologous with the amino acid sequence of residues 1-292 of Figure 2, 

15 covalently linked to a solid support, comprising the steps of: 

1) providing an enzyme containing a polypeptide substantially homologous with the 
amino acid sequence of residues 1-292 of Figure 2, 

2) providing a solid support which is attached to linker having at least one reactive 
group, and 

20 3) contacting said enzyme with said linker under biocompatible conditions in which 

said reactive group reacts with an amino, carboxy, hydroxy, or sulfhydryl group covalently 
attached to said polypeptide to form a covalent attachment. 

26. The process of Claim 25 wherein said linker has at least one group selected 
from among the dialdehydes, diacids, diamines, diisocyanates, cyanates, diimides, and 

25 carbodiimides, provided that a diamine is not used in conjunction with a carbodiimide. 

27. The immobilized enzyme produced according to the process of Claim 25. 

28. A process of converting a halogenated aliphatic hydrocarbon to an alcohol or 
a haiohydrin comprising the steps of: 

1) providing an enzyme containing a polypeptide substantially 
30 homologous with the amino acid sequence of residues 1-292 of Figure 2, 
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2) providing a solid support which is attached to a linker having at least 
one reactive group, 

3) contacting said enzyme with said linker under biocompatible 
conditions in which said reactive group reacts with an amino, carboxy, hydroxy, or 

5 sulfhydryl group covalently attached to said polypeptide to create a covalent 

attachment and form an immobilized enzyme, and 

4) contacting said immobilized enzyme with a halogenated aliphatic 
hydrocarbon under conditions in which said enzyme can convert the halogenated 
aliphatic hydrocarbon to an alcohol or halohydrin. 

10 29. The process of Claim 28 wherein said enzyme is the enzyme of Claim 2 or 3. 

30. An enzyme according to any one of Claims 1-3 wherein said enzyme is a 
fusion protein having one or two, terminal polypeptide tails. 

31 . The enzyme of Claim 30 wherein said fusion protein has a single amino- 
terminal tail of up to 30 amino acids which contains a sequence of at least six contiguous 

15 histidine residues. 

32. The enzyme of Claim 30 wherein said fusion protein has a single carboxy- 
terminal tail of up to 150 amino acids. 

33. The enzyme of Claim 32 wherein said carboxy-terminal tail is hydrophilic. 

34. The enzyme of Claim 30 wherein said fusion protein has both an amino- 
20 terminal tail of up to 30 amino acids and a carboxy-terminal tail of up to 150 amino acids. 

35. The enzyme of Claim 34 wherein said amino terminal tail contains a sequence 
of at least six contiguous histidine residues. 

36. The enzyme of Claim 34 wherein said carboxy-terminal tail contains a 
polypeptide selected from the group consisting of the EXFLAG polypeptide, the S-Tag 

25 polypeptide, hexahistidine-sequence-containing polypeptides, and cellulose binding 
domains. 

37. An enzyme having halogenated aliphatic dehalogenase activity whose DNA 
has been derived, by a directed evolution process, from a related DNA sequence, wherein 
said enzyme has a dehalogenating activity which is greater than that of the related DNA 

30 sequence. 
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38. The enzyme according to Claim 37 wherein said related DNA sequence 
encodes a wild-type or recombinant dehalogenase. 

39. The enzyme according to Claim 38 wherein said related DNA sequence 
encodes a wild-type or recombinant haloalkr^ne dehalogenase. 

40. The enzyme according to Claim 39 wherein said related DNA sequence 
encodes a wild-type or recombinant Rhodouoccus haloaikane dehalogenase. 

41 . The enzyme according to Claim 37 wherein said directed evolution process 
involves performing error-prone PGR. 

42. The enzyme according to Claim 37 wherein the enzyme is a fusion protein 
having one or two terminal polypeptide tails. 

43. The enzyme according to Claim 42 wherein one tail or both tails have been 
modified by a directed evolution process. 

44. An expression vector selected from the group consisting of ATCC 209609, 
ATCC 20961 0, and ATCC 20961 1 . 

45. A cell culture selected from the group consisting of ATCC 

202085, ATCC 202086, and ATCC 202087. 
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Fig. 3 
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